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Editor’s Foreword 


Addison-Wesley’s Frontiers in Physics series has, since 1961, made it possible for 
leading physicists to communicate in coherent fashion their views of recent 
developments in the most exciting and active fields of physics—without 
having to devote the time and energy required to prepare a formal review or 
monograph. Indeed, throughout its nearly forty-year existence, the series has 
emphasized informality in both style and content, as well as pedagogical clari- 
ty. Over time, it was expected that these informal accounts would be replaced 
by more formal counterparts—textbooks or monographs—as the cutting-edge 
topics they treated gradually became integrated into the body of physics knowl- 
edge and reader interest dwindled. However, this has not proven to be the case 
for a number of the volumes in the series: Many works have remained in print 
on an on-demand basis, while others have such intrinsic value that the physics 
community has urged us to extend their life span. 

The Advanced Book Classics series has been designed to meet this demand. It 
will keep in print those volumes in Frontiers in Physics or its sister series, Lecture 
Notes and Supplements in Physics, that continue to provide a unique account of 
a topic of lasting interest. And through a sizable printing, these classics will be 
made available at a comparatively modest cost to the reader. 

These notes on Richard Feynman’s lectures on Statistical Mechanics were 
first published some twenty-five years ago. As is the case with all of Feynman’s 
lectures, the presentation in this work reflects his deep physical insight, the 
freshness and originality of his approach to understanding physics, and the 
overall pedagogical wizardry of Richard Feynman. This volume will be of inter- 
est to everyone concerned with teaching and learning statistical mechanics. In 
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addition to providing an elegant and concise introduction to the basic concepts 
of statistical physics, the notes contain a description of some of the many orig- 
inal and profound contributions—ranging from polaron theory to the theory of 
liquid helium—that Feynman made to the field of condensed matter physics. 


David Pines 
Urbana, Illinois 


December 1997 
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CHAPTER 1 


INTRODUCTION TO STATISTICAL MECHANICS 


1.1 THE PARTITION FUNCTION 


The key principle of statistical mechanics is as follows: 
If a system in equilibrium can be in one of N states, then the probability of 


the system having energy E, is (1/Q)e~ *"*", where 


k = Boltzmann’s constant, and 7 = temperature. Q is called the partition 
function. 

If we take |i) as a state with energy EZ; and A as a quantum-mechanical 
operator for a physical observable, then the expected value of the observable is 


<A = LY Gillie", 
Q jb 


This fundamental law is the summit of statistical mechanics, and the entire 
subject is either the slide-down from this summit, as the principle is applied to 
various cases, or the climb-up to where the fundamental law is derived and the 
concepts of thermal equilibrium and temperature T clarified. We will begin by 
embarking on the climb. 

If a system is very weakly coupled to a heat bath at a given “‘temperature,”’ 
if the coupling is indefinite or not known precisely, if the coupling has been on 
for a long time, and if all the “fast” things have happened and all the “slow” 
things not, the system ts said to be in thermal equilibrium. 

For instance, an enclosed gas placed in a heat bath will eventually erode its 
enclosure; but this erosion is a comparatively slow process, and sometime before 
the enclosure is appreciably eroded, the gas will be in thermal equilibrium. 

Consider two different states of the system that have the same energy, 
E, = E,. The probabilities of the system being in states r and s are then equal. 
For if the system is in state r, any extremely small perturbation will cause the 
system to go into a different state of essentially the same energy, such as s. The 
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E,=H,+E,=H,+E, 


Fig. 1.1 Energy levels in a system S and a heat bath H. 


same is true if the system is in state s. Since the system remains in contact with 
the heat bath for a long time, one would expect states of equal energy to be 
equally likely. Also, states of different energies would be expected to have 
different probabilities. 

Because two states of the same energy are equally probable, the probability 
of a state having energy, EZ is a function only of the energy; P = P(E). 

Now consider a system, S, in equilibrium with a large heat bath, H (see 
Fig. 1.1). Since experience shows that the behavior of a system in equilibrium is 
independent of the nature of the heat bath, the bath may be assumed extremely 
large and its total energy E very great. Also, the possible energy levels of the 
heat bath may be assumed quasi-continuous. 

Let the energy levels of the heat bath be denoted by H;. These levels are 
distributed quasi-continuously. Let the energy levels of S be denoted by E;. 
Then H; > E; for all i, 7, The bath plus the system can be thought of as a new 
system, 7, which is also in thermal equilibrium. 

T has a definite energy, but as that energy is not fixed exactly (the bath is in 
contact with the outside world), we can assume that the energy may be anywhere 
in the range E, + A. If A is sufficiently small, we can assume that the states of 
the heat bath are equally likely in the range H; + A. Let n(#,) be the number of 
states per unit energy range in the heat bath H around energy H,. 

The probability, P(E,), that S is in a state with energy E, is proportional to 
the number of ways S can have that energy. In other words, it is proportional 
to n(E, — E,)2 A = the number of states of H that allow T to have energy in 
the range Ey + A. Then 


P(E,) a n(Eo = E,) 
P (E,-) n(Eo a= Ey) 


— ginn(Eo~ Er)— Inn(Eo— Er) 
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Remember that EZ, « E,. If it is true that (d/dE) In n(E) = BE) is almost 
constant for E in the range under consideration,* then we can say 


P(E) _ o~ B(Er— Er’). 
P(E,) 
SO 
P(E,) oc e7 BF’, 


Normalization requires that P(E;) = (1/Q)e~*# where Q = }); e7**. 

The fundamental law has just been shown to be quite plausible. But for 
those who doubt the constancy of B(E), let us consider some examples. 

First, assume that the heat bath consists of N independent harmonic oscil- 
lators. The energy of the bath is 


N 
F = y n,;hw;, 
=1 


where we assume that 7; is very large, and we neglect the zero-point energy. 
How many states are there with energy less than F? If N = 2, we have the 
situation shown in Fig. 1.2. 

For N = 2, the number of states is proportional to the area of the triangle. 
Clearly the number of states with energy less than F for large F is proportional 
to F’ ; so the number of states per unit energy range is n(F) oc (d/dF)F® oc FN7?, 

In n(F) = constant + (N — 1) In F, 
dinn(F) N-1_ N-1 


ow 
fw 


dF F E, 


since E, « Eo. 


* The basic assumption made here is, that the system governing the probabilities has 
a quasi-continuous spectrum in the region considered and for which there is no partic- 
ular characteristic energy. For such a system, if ¢, and €, are two energy values, then, 
since energy is only defined up to an additive constant ¢, we should have 


f (és) _ fe; + €) 

f(e2) fe, + 8)’ 
where f(e) is the probability. Defining 

f(e) = g(e — 2&2), 
we obtain 

g(e)g(é1 — &2) = gO)g(é, — &2 + 8), 
which is (uniquely) solved by 
g(e) = gO)e"** = (B constant), 

that is, 

FE) _ -pter—e2) 


f(é2) 
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Fig. 1.2 States with energy less than F for two independent harmonic oscillators. 


For N large, 


1 1 
energy per oscillator Ww’ 


where W = energy per oscillator. 
Alternatively, we can go back to the equation 


P(E,) _ n(Eo — E,) 
PEE,) (Eo — Ey) 
and plug in N directly. We get 
n(Eo — E,) _ (Ey — E,)""* 
n(Eo ~ E,’) (Eo = E,)\~* 


(1 — E,/NW)N-1 7 #l® j 
= as N OO. 
(1 — E,|NW)8-} e-Brl® ~ 


We then get the same value for f as by our previous method. 
As asecond example, consider the case of a heat bath consisting of N particles 
in a box. 
_PL+P2+P2  P2+PL+P2 |, Pht Ph + Ph, 
2m 2m 2m 


F 


Assuming periodic boundary conditions, 


2nh(integer) _ 2n,0h 


P, = 
L L 


* Problem: Why is W independent of N? 
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ne 


JF (Lm/sh) > (ni +3)? 


ny 


Fig. 1.3 Number of states with energy less than F is roughly equal to the area of the 
circle shown. 


where L is the length of the sides of the box. Then 


2nh - 2 2 2 2 
F = [mi + my + My + Me, bo + May, 


If there were just two 7’s, we would calculate 6 with the help of Fig. 1.3. 
The number of states with energy less than F is roughly equal to the area of the 


circle with radius J F(Lm/nh). For N particles, we must use a hypersphere in 3N 
dimensions, and the number of states with energy less than F is proportional to 


(/F)3". It follows that B is roughly constant, and equals 1/(2W), where W = 


energy per particle. 
Consider two independent systems S, and Sz with energy levels A; and B,. 


The probability of system S, having energy 4; is 
e BA: 
bere 
Now place S, and S; in loose contact with each other and consider the combined 
system S; = S, + Sg, with combined energy 7, = A; + B,. 


e brlAi + B;) 


P (Aj) 7 


P,(T,) = P(A; + B,) = Ferd Sg Pri 


e brAi e brB; 
7 ae e brai >; e ArB; : 
The probability of system S; being such that system S, has energy 4; is 


e brai e brB; —BrAi 
a en ae 


F SS e BrAi >; e brB; >; e brAi 
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Similarly, 


P p(B A) = Ye PrBi . 
We see that if two systems are placed in loose contact with each other, in 
equilibrium they have the same B. Temperature has a similar property, and in 
fact, by the way temperature is conventionally defined, B = 1/kT where k is 
Boltzmann’s constant.* 
From the basic principle of statistical mechanics, once 


= VS eg EnlkT 
o-% 


is known, all thermodynamic properties can be found. We define F, the Helm- 
holtz free energy, so that 


Q = pa e7 EnlkT = eri. (1.1) 
F= —kT1inQ = —kT In (x ~ (1.2) 
S = entropy, is defined as —k ¥° P, In P,, (1.3) 


* Furthermore, suppose our spectrum has a density of states n(e) which is relatively 
constant when ¢ — fe, that is 


n(Be) = n(e)q(B) (*) 
Then U(f) (see 1.7) = C/B and we recognize in C the usual (total) heat capacity, so that 
C, @ CO _ ata 

BiB B 


which is the well-known (experimental) formula. 
Equation (*) has the unique solution 


n(e) = Ae" 


3 


which is, indeed, the form of n(e) for microscopic bodies (see pp. 3, 5). 
Also, it can easily be seen (from 1.7 below) that 


dU _ ti >; (E, pees Em) ~e7 Peent em) < 0, 

dB 207 nim | 
so that U is a decreasing function of 8. Suppose we mix two systems having different 
B’s, B; < B,. After mixing there will be a common 8, according to 


U,(B,) + U2(B2) = U,(B) + U,(8), 


whence, since the U’s decrease, 8, < B < f,, and energy flows from the low-f body to 
the high-8 body. This also fits into our intuitive perception of temperature. 


1.1 The partition function 7 


where 
P, = = eW EnlkT (1.4) 
Q 
From Eq. (1.2) it can be seen that 
OF 1 — Ep,/kT —E, ] 
cana fees i —k —e " ——* —In@Q = (1.5) 
(ir), ~ ~#B[G6" Lae 
P = pressure = py — P, on = — (*r), (1.6) 
U = average energy = =» Een? (1.7) 
But 
0Q 1 —Ep,/kT 
nae E,e te 
6T kT? 2 
2 
_ kT 0Q_ _ p20 (F ee eee (1.8) 
Q oT OT \T 0T o(i/T) T 


since S = —OF/OT, U = F + TS. 


aU OF 0(,,0F\ OF OF 02F 
Co) ee a a es 
oT), OT OT\ OT) OT OT oT? 


2 
__7T a (1.9) 
V 


To get a clearer idea of the nature of pressure, consider a possible alternative 
definition. The Hamiltonian operator for the system is dependent on the volume. 
Set H = Hamiltonian = H(V). We can take as a pressure operator Pop = 
— OH/OV. 


P= Z Y <ilPopli>e” F/*". 
OK 
Our alternative definition is equivalent to the first one if 
., OH ,. OE; 
<H ay? eve 
But 


OE, gy FeV) = ECV) _ iy SUED = CIID 
OV yay V'’—V Viov V'—-V 


where |i’) is the eigenvector of H’ corresponding to |i>, the ith eigenvector of H. 
H’ = H(V'’). 
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H’ = H + (H’ — @); so we can apply first-order perturbation theory to 
say that <i’| H’ |i’) ~ Gi] H|i) + Gi| A’ — H|i) for HW’ -— Hwx0. Then 


Fr jim SE — BID gy OF py, 
OV vay V'—V OV 


Our two definitions are equivalent. 

Because H may be a function of the shape of the system, as well as its volume, 
our definition of pressure might depend on how the volume is changed. In 
general, for any parameter a, there is a force that can be computed by 


1 OE;\ _-x: 
Force), = — ¥.( — —!) eT Br, 
Q 2 ( eS) 


and we can write 
OE; 
ba 


A third definition of pressure is 


= Gi Sli. 


0U 
P = V, ° 
a S) 


That this definition is equivalent to the other two may be verified without diffi- 
culty from the equation U = F + TS. 

The equation S = —(0F/0T), holds only at equilibrium, where F is defined. 
Away from equilibrium, S always increases with time. To see this, note that 
time-dependent perturbation theory gives 


dP, 
dt 


where | V,,,,7 is the probability per unit time of transition from state n to state m, 
and |Vonl? = |Vinl?. Then 


aS _ ey (tf 


= »; (Viunl7 Pn ras |Vinnl” Pan)» 


ah Ky In Py 
dt 


since 


dP, d | 
i peal = 0. 
» d a dP 
Then 


° = —kY |Vgl2(P, — P,) In Py 


— >) Venl2(P, — P,)(In P,, — In P,). 


1.2 Linear harmonic oscillators 9 


But each term in the sum is negative, for the sign of P, — P,, is opposite to the 
sign of (In P,, — In P,). So dS/dt > 0. 

Before we start to make calculations, note that if we have a system that is a 
combination of several independent subsystems, with E,..4, = E, = Li Fa, = 
>;; energies of the subsystems, 


F = -—kT In Q = —kT In ¥\ e 8" = ee fo Bi In Y eB Xi En, 


1 ,M2, 0% 


-kT in J] » aia! = -KTY E Y eh “ EF 


The free energy of the whole system is the sum of the free energies of its non- 
interacting parts. | 


1.2 LINEAR HARMONIC OSCILLATORS 


Consider a system of harmonic oscillators in thermal equilibrium. The partition 
function Q, free energy F, and average energy of the system of oscillators can be 
found as follows: The oscillators do not interact with each other, but only with 
the heat bath. Since each oscillator is independent, one can find F;, of the ith 
oscillator and then 


M 
F= > fF, 
i=1 
(M oscillators). 
QO, = Ye Batt (1.10) 
E} = h(n +4) — from quantum mechanics (n = 0, 1, 2,...) (1.11) 


QO; = e hi@i(n+ 1/2)/kT 
=k 


e~ hai/2kT 1 — ho; /kT — 2hew;/kT 
reared (era ie ea il sla = 


how; 


F, = —kT InQ, = —! + kT In(1 — e7*kT) (1.13) 


U; = average energy of a single oscillator in thermal equilibrium 


1 i 0.6 OF, 

~lygzie-mer_ 9 1.14 
0, aai/T) T ere 
ho, hae?" — ha, ho, 


ar i a a 
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FS) 2, ke cae | (1.16) 


ho; 
Um HU BL + ef 


It is customary to define an average n,;, n;, according to 


U; = (n, + 4)ha;. 
Thus 
s 1 
n, = pont | : 


Returning to the example on p. 3 we see that for very high temperature 
(and U) we indeed have 
Wx U,;, = kT = 1/8, 
which is independent of U. 
Note that the contribution to F of the ith oscillator is negligible if iw; > kT 
(except for the iw,/2 term). At low temperatures the high-frequency modes are 
“frozen out” and do not contribute to the specific heat. 


1.3 BLACKBODY RADIATION 


In dealing with blackbody radiation, our point of view will be as follows: In a 
cavity (blackbody), there are a great number of modes of oscillation. The 
number of modes per unit volume per frequency bandwidth is given by classical 
considerations. Each mode, however, behaves as an independent quantum 
harmonic oscillator, except that the Aw@/2 term is neglected. That is, £, = nho. 
We want to get rid of the Aw/2 because it leads to infinite energy when there 
are an infinite number of modes. A Hamiltonian that eliminates the hw/2 is 


H, = 4(P? + w?q?) — 


With the above assumptions, we can obtain an expression for the energy 
per unit volume per unit frequency. 
First we will find the number of modes per unit volume per frequency (or 


wave number). 
Assume a gigantic box of dimensions a, b, and c. The demand is made that 


the waves be periodic at the walls of the box. 


number of waves/cm in x-direction 


* | pe 
I 


# 


number of waves in box (x-direction). 


gr | 8 
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Because of the periodic boundary condition, 


a 
— =n, = an integer. 


Ax 
Let k, = 2n/A,. Then 
ak,, 
es Ny, 
T 
and similarly 
bk, ck, 
oo Ny, = Nn, 
T 27 
a dk, _ dee: b dk, _ dn, c dk, _ ii 
21 21 27 
d*n = dn, dn, dn, = abc d*k : “caste th dh, 
(27) (27) 


For each k, there are two possible polarizations. Thus, the number of modes 
per unit volume with wave number between k and k + dk is 
2d°*n_, d*k 
(abc) (27)3 
Now qo = kc where cis the velocity of light. Also, the number of modes is SO 


large that the sum over the modes can be replaced by an integral. 
3 
oat (1.17) 


oe \\| k,T in ( — exp bec) 
V k,T (27)° 


(Remember that the Aw/2 term has been omitted.) 
From symmetry, 
2 
—_ 21k,T In{ 1 — exp GEN) \ dak se 
V k,T (27)? 
U_ D ho(k) exp [ —fa(k)/k,T ] 4nk? dk (1.18) 
V 1 — exp [—fa(k)/k,T] (22)? — 


= hkc/k,T. Then 
4 fa —-x 
U 8x (k,T) | e ee 


Vo (Qn)? Re? o l1-—e 
n°(k,T )* : 
see 7 OT 
21,4 
ake (1.19) 


159303" 


Let x = ho/k,T 
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The preceding results can be summarized and put in a more familiar form by 
replacing iw with hv. 
The number of modes/unit volume between k and k + dk is 
4Ank? dk — k* dk _ 8nxv* dv 
(2n)° 1 c3 
since k = 2nv/c. The average energy of an oscillator of frequency vis hv/e"”*7—1. 
Thus, the energy per unit volume between v and v + dv is 


b] 


2 
_dU _ 8nv* hvdy 7 (1.20) 


This is the Planck Radiation Law, and is the same as Eq. (1.18). 
[v. dy = oT* 


is the Stefan Boltzmann Law. 
For T very large (kT > hy), 


2 2 
ee (1.21) 
ce} 1 + (hv/kT) -—1 c 


This is the Rayleigh-Jeans Law. 
Let F* = F/V and U* = U/V. 


is the heat capacity of a gas of photons in equilibrium with the container. 

If an oscillator (or mode) is excited to the Nth level, that is, if E = hw,N, 
one says that there are N photons with energy fi@;. Since photons are defined as 
the degree of excitation of a mode, one cannot consider a permutation of photons 
as a new state. That is, photons are indistinguishable. This point is the basis of 
quantum statistics and will be dealt with in a later section. 


1.4 VIBRATIONS IN A SOLID 


We want to find the specific heat of a solid. However, en route to the specific 

heats, we will derive important results that will prove useful in many other cases; 

chief among these results is the calculation of the normal modes of a crystal. 
The program will be as follows: 


1. Consider the solid to be a crystal lattice of atoms, each atom behaving as an 
harmonic oscillator. These oscillators are, of course, coupled. 
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2. Find the normal modes of the system. There are as many normal modes as 
there are degrees of freedom; namely, 3(AN), where (AN) is the number of 
atoms in the crystal. The modes behave as independent quantum oscillators. 


3. Given the modes, calculate F, the free energy. 


4. From F, calculate C, and any other thermodynamic quantities of interest. 
Once F is found, we can quickly calculate U, the total energy of the system. 
In the following derivation, we will calculate C,, directly from U without 
always writing down F first. 


Method of Labeling 


Consider a crystal with A atoms per unit cell. For convenience, assume that the 
unit cell is a rectangular solid of dimensions a, b, c, along three mutually per- 
pendicular axes, x, y, z. Let the origin be at the “‘center’’ of a cell. This cell can 
be denoted by the triplet (0, 0, 0). The cell to its right along the x-axis is denoted 
by (1, 0, 0), and so on. Thus, any cell can be denoted by a vector N = n,a + 
nyb + n,cwherea = ai,b = bj,c = ck. If there are A atoms/cell, 34 additional 
coordinates must be given to locate each atom. Let « denote one of these 3A 
coordinates. 

Call the displacement from equilibrium of the coordinate in the Nth cell 
Z,,n- (We either consider m = mass equal to | or absorb it in Z.) 

Z.,n+m 18 the displacement of an atom in a cell close to N; if, for example, 
there are two atoms A,, A, per cell, the displacement of A, in the x direction is 
denoted by Z, y, that of A, (in same cell) by Z, y, and that of A, in an adjacent 
cell by Z; y+1. 1 = (1, 0, 0) or (, 1, 0), or (0, 0, 1). 


Normal Modes 
T=(1/2) Yio Zin = /2)¥ Z2n (1.22) 
a,N 


particles 
and directions 


V=vo+Y & ) Ze 
Z=0 


a,N a,N 


ee es Ae ee, ae ee (1.23) 
2 «,8,N.M \OZ, y OL Ra S= a,N“B,N+M . 


Assume that the electrons in the crystal always have time to adjust them- 
selves to the configuration with lowest energy, even when the crystal is vibrating. 
In this configuration, there is no net force on the nuclei when the Z, y are 


zero, SO | 
( aV ) =a 
OZ4,N Z=0 
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The additive constant, V(0), will not affect our answers, so we might as well 


drop it. Let 
2 
fore eet = Cu ‘ 
OZ..N OZg.N+M Z=0 


a number that depends on the relative positions of the cells of the two atoms, and 
not on their absolute positions. Note that Cy = C,,™. 

Neglecting higher orders, take | 

V=4t Yo ChZinZenem- (1.24) 
a,B,N,M 

For low temperatures it is not too unreasonable to neglect higher orders, 
because the separation between atoms is of the order of 1 A and at room 
temperature the vibrations have amplitude of the order of 0.1 A. But we should 
not be too surprised if experiment shows our idealization to be false. In matters 
such as the one under consideration, the general approach is to make idealiza- 
tions and then try to find corrections to our assumptions that will give better 
results. 

In order to motivate the procedure that we will use for finding the vibrations 
of a solid, let us consider the classical problem of vibrations of coupled oscil- 
lators. Let the Hamiltonian be , 


ers 


where the g;, are the coordinates of the amount of displacement from equilibrium, 
P; = M,q;, is the momentum, and C;; = Cj}, are constants. To eliminate the 
constants M,, let 


P}? 


1 Ci t bd 
OM, 24 9:4} 


qi = giv M, and Ci; = ails: i 
/M.M,; 
P, = ae ene 1 P! 
04; JM, 
Then we get 
Pe. o 
i 2 2 ij 
The equations of motion are 
oH . —0OH 
i= = P;, Ey = = 2, Cid 
, oP, 0g; > sa 


We now break the motion of the system into modes, each of which has its own 
frequency. The total motion of the system is a sum of the motions of the modes. 
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Let the ath mode have frequency w, so that 
g® fas eat a”) 
for the motion of the ath mode, with a‘) independent of time. Then 
czas? = )' Cas”. 
J 
The classical problem of vibrations of coupled oscillators has just been re- 


duced to the problem of finding the eigenvalues and eigenvectors of the real, 
symmetric matrix, ||C;,|]. In order to get the w, we must solve the equation 


Then the a{*) (the eigenvectors) can be found. It is possible to choose the a” so 
that 
>, af?al?) = 6,5. 
t 


The general solution for q; is 
qi = 1 Cig’, 


where the C, are arbitrary constants. If we take Q, = C,e7'°“, we get q; = 
X, a4O,. From this it follows that 


pa ag; — > aalQ, = > b4jQa ae Q,. 
Making the change of variables, 0; = >°; a{/q,;, we get H = >, H,, where 
H, = 4Pz + 4070}. 
This has the expected solutions Q, = C,e7'@*". 


Now suppose we wish to solve the quantum-mechanical problem of coupled 
oscillators. Again we have 


P!? 

= ae Cij i a 

i 2M; d s4idj 
where this time 

_ha 


i Ogi 


P; 
Making the same change of variables as before, we get 
0. = Yala: = ¥ a/M, a 


H =) H,, 
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where 
__ B2 a2 
H, = © + 40202. 
2 002 
It follows immediately that the eigenvalues of our original Hamiltonian are 
E=yY, (N, + 3)ha,. The solution of a quantum-mechanical system of coupled 
oscillators is trivial once we have solved the equation 


0 = det ||C,, — w6,;|| = det ||Cj,//M,M,; — 6,,l). 

If we have a solid with 1(107*) atoms we must apparently find the eigen- 
values of a 107° by 107° matrix. But if the solid is a crystal, the problem is 
enormously simplified. The classical Hamiltonian for a crystal is 

H=4) Zint+4 YO CUZunZpnem 
a,N »B,N,M 


a 


and the classical equation of motion is (using C24} = C;,") 


ZN = pa Caplp.N+M: 
M,B 
In a given mode, if one cell of the crystal is vibrating in a certain manner, it 
is reasonable to expect all cells to vibrate the same way, but with different phases. 
So we try 
Zan = a,( K) e” iat eK: N ; 


where K expresses the relative phase between cells. The e*’% factor allows for 
wave motion. We now want to find the dispersion relations, or @ = @(K). 


wae = Y (CMagel® My eiK-N, 


Yap(K) = Dy Capen’ ™. 
M 


(Note that y,,(K) is Hermitian. See end of p. 18). 
Then w*a, = dg Yag4s, and we must solve the characteristic equation of a 
3A-by-3A matrix: 


Let 


det |lyag — ©76,,|| = 0. (1.27) 
The solutions of the characteristic equation are 
w(K) = w, 
where r runs from 1 to 3A. The motion for a particular mode can be written 
ZAK) = ai (Ke iW giK-N 


where 
rier 
yy agar” = Op. 
(ef 
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Then the general motion can be described by 


Zen as Y C(K) al(K)e7 io (W)" ik -N 
K,r i] 
where the C,(K yv n are arbitrary constants, and y is the total number of unit 
cells. _ 
The factor 1 i y is inserted to make things look nicer later, but it is not 
strictly necessary. 
Let OK) = C(K)e7?®", Q.(K) describes the motion of a particular 
mode. : 
Zen = Y, O,(K)az(K)e™ §(1/y/1). (1.28) 
K,r 
It follows that . 
0,(K) oc Y Z,,nde'(K)e*%, (1.29) 
a,N 


and the Hamiltonian for the system is 


1 é 
H = ro 2s a » OM2 Zana 


= >» 16,(K)? + 0?(K)10,(K)|?]. (1.30) 


If we consider Q,(K) and its complex conjugate to be independent variables, we 
get the same equations of motion from 


H = 2 [10,(K)|? + ©7(K)|0,(K)|?]. (1.31) 


1.5 SPECIFIC HEAT OF A CRYSTAL 


We now want to find F, so we must sum over all possible modes. 
For one mode, F = kT In (1 — e7*@x/'T) There is a quasi-continuum of 
K values, and for each K value there are 3A w{?’s. Assuming the crystal has 
volume V, and assuming periodic boundary conditions, we have in the range K 
to K + dK approximately d3K/(2x)*V modes with a given p. Thus, 
= Kr | y Roly) kT in [tebe SO) as 
V a (22) 


d?K = dK, dK, dK,,. 


{x is shorthand for 
\" a/b m/c 
—n/a \ \ 


(1.32) 
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The reason for the limits —z/a, z/a is as follows: the factor K was introduced in 
eR'N Then eK'N = eiKxnxagiKynyboikanee: nn. n, are integers. But, 


iKyn,a i(Ky + 22/a)n,a iKy,ny,a ,27i 


e = @é = € 4 


No new modes are introduced if K ranges beyond the prescribed limits. 

We now demonstrate that the specific heat of a crystal determined by our 
present method agrees with experiment; that is, at high T, C, = 3R. At low 7, 
C,— Oas T°. 

We choose to look at U rather than at F. First we note that there is a max- 
imum frequency, wy. Two adjacent atoms can be no more than 180° out of. 
phase, and thus the minimum wavelength must be of the order of twice the 
atomic spacing (call it 2a). Thus K,,,, © 22/2a = n/a, and wy = w(Ky). 

First, we consider the high-temperature limit. For kT > hay, all modes are 
excited to approximately the same energy; that is, if there are N atoms, 


ho; ho, h 
U = f + a Se ; + kT 
d 2 » exp (fa,/kT)-—1 2 d X 
= U, + 3NkT = Uy, + 3RT, (1.33) 


if N is Avogadro’s number. C, = 3R for large T. Note that we might as well 
neglect the zero-point energy, Up. 
For very small values of T (kT « ha,), the behavior of C,, can also be 
approximately determined. 
Ignoring the zero-point energy, we see that the contribution to U of the high- 
frequency modes is very small, because 
ey eee ~ 0 
exp (4@/kT) —1 
when ha > kT. We also know that when K is zero, there are three zero-fre- 
quency modes arising from translation of the entire crystal. For small K, there 
should be three very low-frequency modes. To see how those modes vary with 
K, consider y,,(K) for low K. Since 


Yap(K) = )) Carpe ™, 
M 
where the Ci} are real, 
Yel K) = 2 Cpe = 2 Cpa ™ = Yap(K). 
Also, for real (K), y3.(K) = Yga(—K), so Yap(K) = ,,(—K), and 
f(@, K) = det lly ap(K) = wo Sze = det lly pa( KX) aa oO Spall 
= det [lyug(—K) — 075,5ll = f(@*, —K). 
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Then, because f is an even function of K, 
Of - 2 
—_ (w*, 0) = 0 
aK (w*, 9) 


Since the characteristic equation must have a solution with mw = 0 for K = 0, 
f(0, 0) = 0. The characteristic equation becomes 


we .ivee 2 


j + higher order terms, 
Ow? 2! i,j OK; OK; 


0 


so that, for low w and |K| we have 


6*f 1/2 
COS Mi; COS Oj 
— IK i j {0 , 
Oe Daf loa? 


where cos a; are the direction cosines of K. 

In other words, w = |K|V, where V is the velocity of sound, which may 
depend on the direction of K when the eigenvalues of 07//0K,dK,|y are not all 
equal. But, for sufficiently low frequency, V does not depend on the frequency. 
This is the Debye approximation. 

For convenience, assume that the three sound velocities are equal; that is, 
@ = VK. 


hao; ho; 
= d ke : ehoukT _ 4 


Km r2 
hoy 3] ho(K) 42K dK: w 


-¥ 


es K; K =—. 
2 eholkT _ 1 (22)? Vo 


The factor 3 before the integral is there because for each K there are three modes 
with low a. 


Om 3 
U =U, + at ee da, where Ou = VoKmax (1.34) 
; | 


Kynax Can be found by setting the total number of modes equal to 3(AN), where 
(AN) is the number of atoms in the crystal. 


——_ 4nK? dK = Ky 
o (2x)? 2n7 


Kmax 3 Kmax 
3(AN) = 3 d°KV = 3V 

) (2n)° 
Then 


Ky = (6x7p)'/? where p is the number of atoms per unit volume. 
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Let 
| 37 3,3 , 
ee Gece ee Sige @ = KV 
kT T h h° h k 
UU, 3k4*T* (©mMIT x3 dx 
— =— —% + as — U + AT*. 1.35 
Vo OVO? V3 | e* — | . cy 
Here © is the Debye temperature. 
C, = 4AT? (1.36) 


where A > 17k*/10h°V2 as T/O = 0. 
So C, is proportional to T* at low temperatures. 

The T° dependence of the specific heat at low T can be made more plausible 
as follows: We assume that for iw; > kT, the contribution to the energy of the 
@; mode is negligible and for Aw, < kT, the contribution is kT. 

The number of modes with wave number less than K is 


K4nk? ,  K°V 


3V —— 
9 (2x)? 2n” 


At low T, K = w/V>. Therefore the number of modes with frequency less than 
w, is o2V/2n?V3. But ia, = kT or w, = kT/h and 


n = (number of modes with frequency less than w,) = k°T?V/2n7?V3h°. 


The energy varies as 


3 44 
ne Sa ei A 
Vian? hP VB 2n7h?V 3 
And thus C, is proportional to T°. The numerical factor is, of course, not 
correct. 
If the mode with frequency w, is excited to the nth level, E; = fiw;,(n + 3), we 
say that there are n phonons of frequency w; and energy ha, in the crystal. 
Figure 1.4 shows the general form of the specific heat of a solid as a function 
of temperature. 


Example: Assume the unit cell to be a cubic lattice with one atom per cell 
(Fig. 1.5). Each atom behaves as an harmonic oscillator, with spring constants 
k., (nearest neighbors), and k, (next-nearest neighbors). This case is fairly simple, 
and we can simplify the notation: « = 1, 2, 3. 


Z1.N = Xn; Z2N = Yn, Z3N — LN: 
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T/® 


Fig. 1.4 Specific Heat of a Solid. 


We wish to find the three natural frequencies associated with each K of the 
crystal. To do this, we must find C‘, and then y,. In complex coordinates 


V=) Vip 
a,B 
where 
N,M 
For example, 
Vu = 2 She). 0). Carers (1.37') 
N,M 


If we express the displacement of atom N from its normal position as Xy, then 
the potential energy from the distortion of the spring between atoms N andM is 


Fig. 1.5 Cubic lattice with one atom per cell. 
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Ky = K, for N + M a nearest neighbor to N, Ky = As for N + Ma next- 
nearest neighbor. 

In summing over N and M to get the total potential energy we must divide 
by two, for we count each spring twice. If we use complex coordinates, however, 
we multiply V by two to get the correct equations of motion. 


ye 0 eee eee (1.38) 
2 hy AN N+M iM ; 
1 My|? -ys 
Vir == DY Km (Xn — Xynim\(Xn — Xy+m) 
2 NM |M| 
1 Mx\’ 2 * x x 
ae Ky [(XnXn + XnimXn+m) — (XNXnim + XnN+mMXy)] 
2 N.M [M| 
= K Mx\ XnXn — XnX 
» Ku [XnXn nAn+m]- (1.38’) 
N,M |M| 
Comparing Eqs. (1.37’) and (1.38’), we see that 
M M,\? 
K =) cM*® — —K,,|—*). 1.39 
=¥ “(AD , “(mA (1.39) 


Here (My/|M|)* = 1 for M = (+1, 0,0), 4 for M = (+1, +1,0) and M = 
(+1, 0, +1), and zero for the other nearest and next nearest neighbors. So 


C), =2K,+4K,, C#{'%° = —K,, and so on. 


In this way, all the C4, can be found. We can then calculate 
Ya,p(K) = d Cole ™). 


We wish to solve det |y,., — w76,,| = 0. 
For each K, there are three solutions for w. Thus we obtain 3N values of 


w, o)(K). 


1.6 THE MOSSBAUER EFFECT 


If a free, excited nucleus goes to its ground state by emission of a photon, the 
energy of the photon will be less than the excitation energy because the nucleus 
recoils. But if the excited nucleus is in a crystal, there is a finite probability that 
it will emit a photon with the full excitation energy. In other words, there is a 
finite probability that the state of the crystal after radiation will be the same as the 
initial state. Similarly, there is a finite probability that the crystal state will be 
unchanged by absorption of a photon. This effect, called the Méssbauer effect, 
can be discussed in terms of modes of oscillation of a crystal. For example, 
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suppose the temperature is absolute zero. Then the crystal must be in its vibra- 
tional ground state before emission of a photon. We will find an expression for 
the probability that the crystal is in its ground state after emission. 

Let R be the position of the excited atom, and let P be the momentum of the 
emitted photon. We will assume, without proof, that the amplitude for the 
crystal being in a given final state is 


a = Cfinal |Ae'?’®/"| initial). 


Furthermore, we will consider only the vibrational states of the crystal, and will 
neglect any effect due to the fact that the nucleus changes state. 

Let the Mth atom be the one that emits the photon; let Ry y be its mean 
position, and let Z,, be its displacement from that position. 


R = Row + Zy- 


We can take the origin of our coordinate system at the mean position of the 


excited atom. Then Ry y = 0. 
A single one-dimensional harmonic oscillator in the ground state is described 


by the wave function 
1/4 
w=(2)\ e-or2m 
th 


where Q is the position measured in units such that the mass can be set equal to 
unity. The wave function of a crystal in its ground state is a product of the wave 
functions for each mode of vibration. 

Il ewe 7 20)(K)1Q2(K)|/2h 


Werystal = 


th 


Sv 


We wish to calculate: 


— | | | Werystaten et Werystal 


where F is the probability that the crystal will remain in its ground state and C is 
a constant independent of P. As P — 0, F must tend to unity. So, if we can get 
a formula for F without the correct constant in front of it, the constant can be 
determined easily. We will ignore such factors as “A” and “(@/xh)!/*.” 

We take Z, 4 = Z,.9 as the displacement of the nucleus that emits the 
photon. For simplicity, assume the momentum of the photon is in the « direc- 
tion. Then we wish to compute 


FH? | | | (1 exp ea ee (exp [iPZ,,o/A]) 


2 


(1.40) 
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But 
i 
Vn 


Zu,0 a 2 0, K)a,(K) 


SO 


F? oT] || exp og | exp |: io 0.4K) a8) | 40,0) | 
K,r h hs/ | 


ce TT exp [rg] — - 5 aa 


Kr 4ha?(K yy Kr 4ho(K)y 


Notice that 


((Zin)*> = ( Yala *O,(K OK yel&-¥-" ") 
| 


K 
K’, 


) 
= 
~ 


hla (K)|? 


r 2 2 1 = 
aI OF)) = Ona 


K,r 


So we can write 


F = exp i (7) Zia) |. (1.41) 


The value of F can also be found for nonzero temperature. If “7” is the 
number describing the state of the crystal, we must compute 


F, = [ciel |i>|? 


and then 


F = _ ny Fe *i™, 


t© 


iP 
hs/'n 


where |n) denotes the nth state of a single one-dimensional harmonic oscillator 
with frequency w(K). As 7 is very large, we can write 


F,e7F/# = T] 


Sv 


<n| exp 


0.4K) a0) in| exp fear mee | 


kT 
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F; exp Ce = Lees FP <OMK)>4laz(K)P a 
K,r 
x exp 7 + 4) | 
a 1 P2Q2n + 1h, parr |? —(2n + ne) 
©  F aay eM | exp kT ; 


P2(2n + 1), pane —(2n + 1)fio(K) 
Fx rn — eM EDae IO! lex ae 


-T > [1 - P2(2n + ) I a a =e + _ 


K,r n=0 2@ nh 2kT 


P? ae eg holkT 
2 
oc = at 5 | a‘, 1 ae 


. {a e” hol/kT 
- p2 om |ai(K)|? To estore 


As before, we get for our answer 


F= exp |r <Z? >|) (1.41) 


1.7 QUANTUM STATISTICS FOR A MANY-PARTICLE SYSTEM 


Consider a system of N identical particles, and assume that there is no interaction 
among them. Any two configurations of the system that differ only by an inter- 
change of two or more identical particles are regarded as one and the same state. 
Thus the state of the system is determined by giving the number of particles n, 
with energy 6,. 

Our problem is to calculate the partition function Q with the condition 
>) 2, = constant. The values admitted for every n, may be 


(a) n, = 0,1, 2,3,4,... Bose-Einstein case 
(b) n, = 0,1 Fermi-Dirac case 


Bose-Einstein statistics must be used for particles with integral spin (for 
example, He*) and Fermi-Dirac statistics for particles with half-integral spin 
(such as, electrons). For Bose particles, any number of particles may occupy a 
given state. For Fermi particles, however, there can only be one particle at most 
in each state (Pauli exclusion principle). 
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A state of the system is described by the set of numbers 1,, which can take on 
any set of values allowed both by the statistics and by the condition }\, n, = N. 


Q= YY exp (-/ » nate) (1.42) 


n1,N2, eee 


If there were no restriction on the number of particles, we could write 


Q={[] (z cme), (1.43) 


and we would have in the Bose-Einstein case 


iia 


In the Fermi-Dirac case we would have 
Q=[](1 + e7*). 


Unfortunately, Eq. (1.43) is incorrect, because we have an auxiliary condition 
that >, 2, = N = constant. With this auxiliary condition, the problem of 
finding Q becomes much more difficult. It is possible, however, to get around 
the restriction by considering the system of particles to be a box connected to a 
large reservoir of particles (Fig. 1.6), and by assuming that it is possible for 
particles to pass to and from the reservoir. 

We further assume that the statistical mechanics of the total system acts as if 
it took energy p to remove a particle from the box to the reservoir. py can be 
adjusted by, say, a voltage regulator. In the case of electrons in a metal, for 
example, y is the work function of the metal. In general, increasing wu will in- 
crease the expected number, «N), of particles in the box. If we can find (N) as 


Fig. 1.6 System of particles considered as a box connected to a large reservoir. 
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a function of , we can in principle select » so that any desired number of 


particles are in the box. 
For a given p, the energy levels of a particle in the box become ¢, — yp, rather 


than g,. 
By the key principle of statistical mechanics (as applied to the box), the 


probability of the gas having energy E = >, n,(&, — ) is proportional to e~**. 
Qm= YY exp —A[L ne, - )]- (1.44) 


The summation can be made without restriction. We will now show how um is 
determined by N and how thermodynamic quantities depend on Q™ or g = 


—1/B In Q ‘), 
eH =O = YS exp (-6 b nies — “)}). (1.45) 


N1,M2,... a 


Let N = 1%, 


a =P ( n) exp (-6 bP ni(Gn »)) 


= J BN exp (-6 bP ie |) | (1.46) 


But 
(ND = oD New (-0 bic 2 )) , (1.47) 
1 eg 1 o (u) og 
S80) Gi Pan ei 


This equation gives (NV) = <N(u)). Inverting it gives u as a function of (N).* 
Similarly, let (n,) be the average value of n, for a given p (or a given (N)). 
Then <7,) = Og/0f, and <N) = Dig (Ma)- 

For the purpose of computing the probability of a state, we assumed that 
the system acts as if it had energy E = >\,n,(&, — yw). But the pw was inserted 
only in order to provide for the weighting factor that accounts for different 
probabilities of different numbers of particles in the system. For such purposes 


* Problem: The function 


Fi) Se 


v eAleo-H) _ 


has many plus-minus infinity points, and therefore the equation f(z) = N has many 
solutions. Why do we still talk about a unique chemical potential for bosons? 
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as computing the pressure, we do not consider py to be part of the energy of the 
system. 


1 


u=" > ( rte) exp \-6 Pb nies — al = 919 + uN. (1.8) 


Pressure = P = - = (sy d nate) exp |B b nq — alt. 


pcan be treated as a variable or as a function of Vand <N). If we treat it as an 
independent variable, so that g = g(V, yu), we get 


P= — 99 
OV p=const. 


If we treat p (and g) as functions of Vand (NV), that is, with g = g[ V, u(V, <N))], 
then 


= og = P+ <N) On 
OV <N>=const. OV 
is easily shown. Also, 
0 ) 
og aN! . 
ON \y OKN) yp 
If F = F(V,<N)) = 9V, HV, <N))) + (NU, CN), then 
P= ae and w= ce ; 
OV <N>=const. 0<N) V =const. 


Entropy =S = -k > | a In ica: 


N1,M2,.+. 


97 ee 


where 


Erase cate = ga | ~PE Ng(Eq = 0) . 


It is easily shown that 


_ 9g 
7 oT icine: 
In summary: 
g = = InQ@ = 3 In YY exp \-A b NE, — a} , (1.49) 
(n> = ce (1.50) 
08, | 
(Ny = = => <n,), (1.51) 
Ou a 
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u=7 3 ( nate) exp 1-4] nies al} = Td + WN), (1.52) 


P= ‘= : (5 Y rte) exp A b Ce |} 


_ —9g 
av 
— 0g 
éT 


(1.53) 


b] 
m 


(1.54) 


LV 


Soon we will be able to find g for an ideal Bose gas, and then for an ideal 
Fermi gas. But first we must evaluate some integrals. 


1.8 EVALUATION OF INTEGRALS 


We will soon be dealing with integrals of the form [%,, e~* dx and [°,, x7e~%” 
dx. Let us pause for a second to calculate these integrals: 


oe) 2 00 00 
| e* ax] = | e* ax | e-» dy 
oe) oe) baa Se 2n [oo 
| | e499 dx dy = | | e-” r dr dO 
-O Yy-o 0 0 


2n [2 = 
= —4 | | e~"(—2r dr) d@ = [V2]. (1.55) 
o Jo 
| e? dx = Jz. (1.56) 
_ =gx2 1 i er r fe 
| e eeaaee e” y= (y = Vax). (1.57) 
— 00 aj? a 
| x7e~* dx | es Pa ea e~™* dx 
ae ~o da a J-@ 
d a _ paaas 
Sales 7 = 4a79/4/n. (1.58) 
| x3e* dx = 0 (odd function) (1.59) 
00 ; d? 00 _ 
| xte~%" dx = 4 x? e %™ dy = 3q~5/24/q, (1.60) 


and so forth. 
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1.9 THE IDEAL BOSE-EINSTEIN GAS 
From Eq. (1.45) we have 
e7 Po = > exp {—A[n,(e, — w) + no(e. — w) + °°- }} 


N1 NZ, -6- 


ps exp [—Bn,(e, — »)] d exp [ —Bn.(e,-— y)]--° (1.61) 


For a Bose-Einstein gas, n, = 0,1, 2,... 
1 


2 eo Bni(ei— Hu) ar = (Bose Einstein), (1.62) 
1 
-pg 
a e I, — eo Bein)’ (1.63) 
1 ele 
g= ree In (1 — eB), (1.64) 


Consider a particle contained in a box but otherwise free. The number of 
modes in the box with momentum in the three-dimensional region d?p is 
s(d°k/(2n)*)V, where V = volume and s is the number of possible spin states 
(e.g., s = 3 for spin 1 when the rest mass does not vanish; s = 2 for spin 1 when 
the rest mass vanishes, as for photons). The energy is 


Here we have approximated the discrete set of modes by a continuum (which is 
the case for a completely free particle). The sum for g can then be replaced by an 
integration: 


1 —B 2/2m d*p 
g =s—|In(1 — e78P7/?mein) __*_ yp (1.65) 
B (2nh)° 
— Bp?/2m Bu 3 
pe ge (1.66) 
V V op 1 — e bp*/2mobu (27h)? 


For computational purposes, set « = e* and x* = Bp*/2m. Then p* = 2mx’/B 
and dp = (2m/B)'!? dx. | 
Ufc a a Pe ae 
(2nh)° (2nh)> = 2n?h° 


B 
fe ea x? (2m\3? Pe 
p=s| 1 eee | 
1 2m\>"? 26 5—x2 2, — 2x? 
‘aap(#) | | x*ee dx + a’e ax to] 


mkT\>!? a? a? 
(oa) hae 5 (1.67) 
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where we have used {@,, x7e—® dx = (/ sia) ff a), from Eq. (1.58). Letting 


Ca) = 2 = 
we obtain finally 


3/2 
pas sr, baa). (1.68) 


Thus, given p, we can find « = e** (in principle) by solving the equation 


2nh?\3!? p 
C3/2(%) = A =4 T 3/2 


For the total energy of the system we have 


—_ ue) eens 


ere 18 La. y 4? 
1 e BP2/2mobu (2nh)3 


s3kT sr ) VE5/2(0). (1.69) 


For very low p, high 7, or both, p/T?/? is very small; then C, j2(a) is very small, 
and thus « is very small. In this case €3/2(a) ~ a, €5)2(a%) ~ a, so that 

C5 /2(%) ~ 1 

C3/2(%) 
and U/V ~ (3/2)kTp. This condition therefore represents the classical limit. 


Now let us look at the other limit. As T becomes lower and lower, with p 
fixed, €3,2(«) becomes larger and « approaches 1. For a > 1, ¢ diverges. 


1 i 


C3/2(1) = | + 53/2 * 3372 et = 2.612, 
1 1 
C521) = 1 = 552 + 352 tT = 1.341. 


The temperature T, at which a = 1 1s called the critical temperature for 
Bose-Einstein condensation. 


«ND _., — , (mk) 
ela ita (2.612), (1.70) 


2nh? [ pis \?° 
T, = ——[{=—]} . 17 
mk (05) iy 
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The question arises: Why does our analysis break down at T,? The answer is 
that for such low temperatures (T < T,), we cannot replace the sum, Eq. (1.64), 
by the integral, Eq. (1.65). 

Looking back at g and <N) before we approximated all summations by 
integrals, we recall that 


g= » In (1 — eo FteF) = soln (1 — e7 Pg); 


ae — Be; 
i ee Ee ee 
Lu ; 1—e '*a ra e% — ] ie Phebe — | 
For small «, e~ *" is large; the terms with the lowest e, do not contribute 
much to the sum, and we can replace the sum with an integral. When e7 °* is 
small, we cannot replace the sum with an integral because the first few discrete 
terms in the sum are important. Now 


og 1 
(Ma? de, (2 F#eh ta) — 1 
Because <n,» is positive, (¢, — ) must be greater than zero. For <N) to be 
larger than its value at T = T., (€, — y) must be positive, but very small, in 
order that the low-energy terms in <N) be nonnegligible. Assuming no acci- 
dental degeneracy in the lowest level (that is, assuming & # &,) we can conclude 
that, for sufficiently low temperature, 


1 1 
(m1) = hei) — 4 . pbeo-) <No). 


Without any loss of generality, we may take e, as the zero of the energy. Then 


(ng) = — = KT in(1 rae js _*T for large <n). 
e ——_ 


1 <No> <No> 
For low temperatures, p is very close to zero, so for the energy states above é, 
we can neglect p. The sum for (N — no) can be approximated by an integral: 


00 1 dp 1 
IN exe = <N ~~ No> =. >> Pei ae | ae (2h)? eP?/2mkT = | 
3/2 T\3/2 
= 2.612 = sV = <Ny(—) . 
20h T. 


We see, then, that our definition of the critical temperature is such that when 
T < T., there must be a nonnegligible fraction of the particles in the ground 


state: 
<No) = (N) ae AN eg 
= (N)[L — (2/T,)°""]. 
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Bose-condensation Liquid helium 


Fig. 1.7 A Bose-Einstein gas around T, compared with liquid helium around the 
lambda point. 


For T < T, the total energy is 


kT\3/2 C</(1) 
U = 3kTs(™ Vle,(1) = 8kT Nee 22 
2 Ss 4 C5/2( ) yi 5 /9(1) 
= 3kTN,,.(0.5134) 
T 3/2 
= 3kT(0.5134) (7) <NY. (1.72) 


Then C, = 0U/0T o T?! at sufficiently low temperature. 
For T > T,, we have (N) = (mkT/2nh’)?'"¢3,.(a)sV. Using the definition 


of T,, we have €3/.(«) = 2.612(T,/T)*/*. This equation determines « from (NV) 
and 7. Then 


U = 3kT<N> 65/2(4) ; ~ (1.73) 
C3/2(a) 


The behavior of the specific heat of a Bose-Einstein gas around T, is rather 
similar to the behavior of liquid helium near the so-called “lambda point” 
(A point). See Fig. 1.7.* Fora given p, T, and A are very close to one another 
(the lambda point is a few degrees K). Because of the mutual forces between 
particles, liquid helium is certainly not an ideal Bose gas, but perhaps part of the 
explanation of the lambda transition involves Bose condensation. 


* For a calculation of the discontinuity in (0C,/d7), at the transition point, see 
L. D. Landau & E. M. Lifshitz, Statistical Physics, problem of §59, Pergamon Press, 
1959. | 
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A more exact description of the behavior of liquid helium near the lambda 
point is given by the empirical formula: 


C pas a+ bin|T — T,l, ID <= Ty, 
"“ )a'+bin|T—-T), . T> Th. 


The explanation of this behavior is left as an exercise for the reader. If successful, 
publish! 


1.10 THE FERMI-DIRAC GAS 
For a Fermi gas, we proceed exactly as for the Bose case up to the point 


e fo — y e bni(e1—2) > e7 Bna(ez-W) (1.74) 


ny na 
Here n, = 0,1; 7, = 0,1; and soon. Thus 


eM = TY (1 + nem) (1.75) 


fee a Yin (1 + en Mermmy, (1.76) 
Once again, we can approximate the sum by an integral. This time there is no 
danger that a sizable fraction of the particles will be in the lowest state. Assume 
that the gas is of electrons, so that s = 2. 


1 <cnleh amie & d*p ; 
= —(_-) | In(1 + e hem) = py, 1.77 
g () | ( ) onhy3 (1.77) 
e B(p?/2m) Bu 3p 
et eee, a 
V V dp 1 +e FP7/2mobu (22 H)3 
e” Blea— ) 
Cie eee, (1.79) 


de, 1 + e  Flea—H) eba-w) 4 4 


The form of <n,) for T = 0 and for any T > 0 is shown in Fig. 1.8. Now at 
T = 0, (n,) = life, < pw; and (n,) = Oife, > vw. Let = wo at T= 0. In 
other words, at T = 0 all the states with energy less than yy are occupied, and 
all with energy greater than py are empty. Lo is called the Fermi energy and Uo/k 
the Fermi temperature. 


_ [?°, ( 4np? _ 4npo\ 1 
p= { 2 we dp 2( 3 lee , (1.80) 
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na T=0 Na T>0 


Fig. 1.8 Occupation of states at T = 0 and T > 0. 


iG) 


= p?/2m 
iy e= p*/ 


Fig. 1.9 The density of states as a function of energy. 


where flo = pol2m. For most “reasonable” values of > and T, po > kT (for 
example, uo/k for copper is 82,000°K*). 
Since 
2 3/2 
7 4np* dp ms 4x(2m) Ss de 
(2xh)° (2xh)° 
the density of states with respect to energy is 


4n(2m)>!2 ve 


IO) = ~ ony 


This function is shown in Fig. 1.9. 
2 — B(p?/2m—p) 3 
VeOV \ Op 2m] 1 + e Ar*/2m—H) (27h) 


* Also, in states of very condensed matter, as in a neutron star, “/k of the neutron, 
proton, and electron liquids are ~ 101° — 101!°K whereas T ~ 10°°K. 
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At T = 0, 
2 [Pip Anp* dp _ 47 1 ae 
u(0) = { Sues ee m2 |. p in) 
An 1\1 =, 
~ Ont (sn) cae ie 
ole 
u(0) _ L@2xh)> \2m/) 5 °° = 5 (28) = tu (1.83) 
p An 2 3 5\2m 
iE ct 


We will now tackle the case of finite 7. For T 4 0, what is » and what is the 
specific heat, etc.? 


e B(p?/2m) obu 3 
p = Sh eer ee eae ae (1.78) 
V on 1 + e p?/2m ob (D7 D3 
2 — B(p?/2m) ,Bu 3 
(p*/2m)e "iP eme™ 1) al i e’¥ 2d-p (1.81) 
1 + e7 bP*l/2mebu ae 


For convenience, we wish to express these integrals in terms of the energy e: 
e = p’/2m, J2me = p, dp = V/m/2 de. 
d?p becomes 4p? dp de = 2n(2m)?/?x/e de. Thus, 


p= (eT ee Vide =o [ee (1.84) 


0 &&-) 41 (2nh)? 9 PCH 41 
and 

00 3/2 00 3/2 

u= eer asl s/2de=a eee , (1.85) 

> eFe-H 4+ 1° (22h) 9 chew + 1 

where 
_ 4n(2m)?” 
(2nh)> — 
In both cases, the integral that must be evaluated is 
pu | gerd 
- » elem) 4 1’ 


where g(e) = ave for p, and g(e) = ae*/? for uy. Then 
©  g(e) de r~ g(e) de ‘i 4 g(e) de 
= — a s)de — | ———~—__.. 
: | ehe-H) 4 4 - eFe-H) 4 4 ‘ I ) e Fe-wH) 4 { 
In the first term let x = B(e — yp), and in the third term let x = — P(e — p). 


_t ” g(u + x/B) dx (* g(u — x/B) dx 4 ge 
r= ["a@de+ | eet Sp \ Fat 8 (1.86) 
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For what we have called “‘reasonable” values of p and 7, not much error will 
arise if we replace g(u + x/B) by g(u) + (x/B)g’(u) and [§* by [?. Then 


u 2 ° x dx 2 n7g'(u) 
Iz é)de+—g’ = &) de + ———.. 1.87 
[, 9@ Ped ee [9 oe. asp 
(The integral 
° x dx 
0 e~ + 1 


can be evaluated by expanding xe */(1 + e ~)in a power series, then integrating 
and summing; the result is 27/12.) So, 


7 m2 
I = | g(s) de + eT *9'W. (1.88) 
0 


Remember that for J = p, g(e) = ave, and for J = u, g(e) = ae?! 


p=a [Maes GOS 
Ll 


2 
= (3) w3/2 4 = ery (1.89) 
3 Vn 
But we have also p = (2a/3)u3’? from Eq. (1.80). Thus 
tok Ee 
wr u(t] (1.90) 


2 
5 
2 5/2 5/2 m0? (kT)? an? re 
~w-—a —a — —— + —— (kT 
5 Ho Ho Dy 2 4 (kT) V Ho 


an a. 2 = 
= Uy + —— V Mo (KT)? =up + 7T?;) y= “Vito kK’, (1.91) 


u= Uy + 7T?; U=w =U,+ yVT* =U, + y'T? 


oU 
Cy = — = 2y'T, 
V oT Y 


Cy of a metal = 2y’T + aT? atlowT 


elect. lattice 
vibrations 
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At very low T the y’T term dominates, and the contribution of the electron 
gas to the specific heat is noticeable. 

C, of ametal = 2y’T + 3Rat high T. Since the part of the specific heat due 
to lattice vibration is constant, the y’T term is again detectable. 

Note that at very high 7, we have to include higher terms in the expansion 
of g(e), and the part of the specific heat due to the electrons is no longer propor- 
tional to T. 

We can demonstrate more easily than we have done above that the internal 
energy is proportional to 7”. At any temperature less than the Fermi temperature 
almost all the states with energy less than the Fermi energy are filled, and almost 
all states with energy greater than the Fermi energy are empty. As the electron 
gas is raised to temperature 7 from T = 0, the average energy than can be im- 
parted to an electron is about kT. 

Also, only those electrons within about kT of the Fermi level will be excited, 
because electrons with less energy have no place to go—the states are filled. The 
number of electrons with energies between py and Uy — KT is 


ito 
N= | ase!!? de. 
Ho—kT 


For tg > kT, N is proportional to kT. Thus the internal energy added is pro- 
portional to (kT)kT = k*T?. 


CHAPTER 2 


DENSITY MATRICES 


2.1 INTRODUCTION TO DENSITY MATRICES 


When we solve a quantum-mechanical problem, what we really do is divide the 
universe into two parts—the system in which we are interested and the rest of the 
universe. We then usually act as if the system in which we are interested com- 
prised the entire universe. To motivate the use of density matrices, let us see 
what happens when we include the part of the universe outside the system. 

Let x describe the coordinates of the system, and let y describe the rest of 
the universe. Let g,(x) be a complete set of wave functions. The most general 
wave function can be written 


W(x, y) = py Ci y)9A(x). (2.1) 


At this point we will convert to Dirac notation. 
Let |g;) be a complete set of vectors in the vector space describing the 
system, and let |0;) be a complete set for the rest of the universe. 


p(x) = <xl9i) and — Oy) = ¢y19;). 
The most general wave function can be written 
> = 2, Cisl9ir0;> (2.2) 
W(x, y) = Cylxly> = 2, Ci<xlPir< y1O;> 
We can obtain Eq. (2.1) by taking 
Cy) = 2, Cj 318). 
Now let A be an operator that acts only on the system; that is to say, A does 


not act on the |#,;>. When A acts on product states (for example, |y/)) we really 
mean A|a>|b> = (Ala))|b). In such a case A does not equal 


2 Av lo ><orl, 
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but equals 
2 Aivloid!0,><8jKXorl. 
Then 
(A> = CWIAlY> = » Ci Cr XO j\(@i1Al@210;> 
ij’ 
= d CC: <91AlQy> 
= )' <eil Ali) p7i (2.3) 
where 
pix = Y, CHC;,-; = density matrix. (2.4) 
j F 


We define the operator p to be such that p;; = <9;|p|9;>. p operates only on 
the system described by x. 


WIAly> = » <oi|A pa lpr ><erlplen 
= )' <gilAplo;> = Tr pA (2.5) 
Where we have used the result 


Y |9><g| = 1 (by completeness arguments). 


From Eq. (2.4), it is obvious that p is hermitian. Therefore it can be diagonalized 
with a complete orthonormal set of eigenvectors |i) and real eigenvalues w,, 


p = wilid<il. (2.6) 
If we let A be 1, we obtain | 
dws = Trp = (AD = WI) = 1. (2.7) 
If we let A be |i’)<i’| we have 
w; = Tr pA = (A) = (WIAD = 2 (CHIE MCOAEYD) 


= VKH DIWI?. (2.8) 
Therefore, 
w,>0 ~~ and 2» w, = 1. (2.9) 


We now consider the concept of a density matrix independent of the pre- 
ceding motivation. First let us reformulate quantum mechanics: 
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Any system is described by a density matrix p, where p is of the form 
Di wilé><i| and 
a) the set |i) is a complete orthonormal set of vectors. 
b) w; => 0. 
C) >; W; = 1. 
d) Given an operator A, the expectation of A is given by 
<A) = Tr pA. 
Notice that 
<A) = Tr pA = Y<i'[pAli’> = Y wiXi'lid<iAli’> 
= ¥' wil Ali. (2.10) 
Since <i|A|i) = the expectation value of A in the state |i), it is obvious from 
(b), (c) and Eq. (2.10) that we can interpret the w; as the probability that the 
system is in state i. If all but one of the w; are zero, we say that the system is in a 
pure state; otherwise it is in a mixed state. It is easy to show that a necessary and 


sufficient condition for a pure state is p = p*. If a system is in a pure state, 
lénure» WE Can express the matrix as 


p= iene) <touiels 
Pij = <@ilplpj> ae <@ilipure> <tpurel Pj 


= (Qilipure((@ jlipure)* (2.11) 
More generally, 


Pi; = d wiXQiIk><@Ik>*. (2.11a) 
If it is possible to discuss the system in the x-representation, we can write 
p(x’, x) "=" <x'Iplx> = Y wi<x'li><ilx> 
= x w,i(x’)i*(x), (2.12) 
which, for a pure state |i), becomes 


p(x’, x) = i(x’)i*(x). (2.13) 


In the x-representation we write 


<A> = Tr pA = | ax<xlpdla. 


42 Density matrices 


But 
Cpa ele (| ax'’ <2) Alx) 


- { dx'<x\p|x"><x'lAlx) = | dx’ p(x, x")A(x’, 2). 


<A> = [a dx' p(x, x’)A(x’, x). (2.14) 


Considering again the problem of a system plus the rest of the universe, we can 
easily show 


(A) = | W(x’, ACR’, Ge, y) dx dx! dy. 
We see that in this case 


joy = | Wx, YW’, y) dy. (2.15) 


From our example in which we split the universe into two parts, we see that 
pure states are not general enough to describe a quantum mechanical system that 
does not include the whole universe. It is unknown whether or not the universe 
is in a pure state. To reformulate quantum mechanics in terms of the more 
general density matrices, it would be appropriate next to find the equation of 
motion of p. But first, as a simple example of a density matrix, let us try to 
describe polarized and unpolarized light. 

Consider a beam of light travelling in the z-direction. First define 


(0) Wave function for the x-polarized state 
(2.16) 


(°) Wave function for the y-polarized state 


Any pure state can be written as a linear combination of the two states in Eq. 


(2.16): 
(5) ou (;) +b (1): (2.17) 
b 0 1 


lal? + |b]? = 1. 
When we use Eq. (2.11), the density matrix for the pure state Eq. (2.17) becomes 


es a i (2.18) 


where 
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In using Eq. (2.11), we took 


| 0 
o> = (6) and 2) =(1), 


Now to examine four pure states and the density matrices corresponding to 
them: 


The x-polarized state: Leta = 1 andb = Oin Eq. (2.17). Equation (2.18) gives 


1 0 
= ; 2.19 
Px-pol. ‘ 7 ( ) 
The y-polarized state: Let a = 0 and db = 1. 
0 0 
= 2.20 
Py-pol. (; ; ( ) 
The 45°-polarized state: a = 1/x/ 2andb = 1 If 2 gives 
P4s°-pol. = (; ‘). (2.21) 
The 135°-polarized state: a = —1 I 2andb = 1 I 2 gives 
P135°-pol. = ( 2 7 . (2.22) 
-t 4 


The p’s in the last four equations are for pure states. Now consider the following 
two mixed states: 


1. Mixture of 50% x-polarized and 50% y-polarized states: From Eq. (2.11a) 
the density matrix for this mixture is 


p= $Px-pol. + $Py-p01. 7 (5 2 (2.23) 


0 4) 
2. Mixture of 50% 45°-polarized and 50%, 135°-polarized states: The density 
matrix for this mixture is 


0 
P = 4Pas-por. + $P135-pol. = t : . 
Thus these two mixtures have the same density matrix and show the same phys- 
ical effect. Note that a given pure state (for example, x-polarization) determines 


a state vector (wave function) only up to a phase factor, whereas the density 
matrix is determined uniquely. 


(2.24) 
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2.2 ADDITIONAL PROPERTIES OF THE DENSITY MATRIX 


Recall that the density operator can be written 
p = Ywili><il, (2.25) 


where the system is in state |i) with probability w;. As time changes, the possible 
states of the system also change; so 


p(t) = ¥ wili(t)><i@)]. (2.26) 
It is easy to find how |i(t)) changes in time, for we can expand |i(0)) in eigen- 


states of the Hamiltonian, H, and we know how those eigenstates change in time. 
Let H|E,) = E,I|E,)- 


li()> = LE, ><E,|i(0)>. (2.27) 
It follows that 
li(t)> = ) IE,>e'*""E,|i(0)>. 


For convenience we use units in which # = 1. If we define f(A) for any function 
f by the equation f(H)|E,) = S(E,)|E,>, we can write 


(D> = Ye ME, ><E,liO)> = e ™[i(0)). (2.28) 


Notice that, according to our definition of f(/), if f(Z,) can be expanded in a 
power series for any eigenvalue of H, then (#) can be expanded in the same 
power series. 

Substituting Eq. (2.28) into Eq. (2.26), we get 


ple) = Y, we ™1i(0)><i(O}le™ = e—*p(O)e™ (2.29) 


Taking the derivative of p with respect to time, we get 
p = — i(Hp — pA). (2.30) 


You may be suspicious about treating H like a number when taking the 
derivative of terms such as e ‘#‘, but such a procedure can be justified by 
expanding e~ ‘#* in a power series. Alternatively, we can obtain Eq. (2.30) from 
Eq. (2.29) by taking the derivative of (E,|p(t)|E,,». In any case, Eq. (2.30) is 
true, and it plays the same role for density matrices that Schrédinger’s equation 
plays for wave functions. It should be noticed that an observable A in the 
Heisenberg representation obeys 


A = +i(HA — AR). (2.31) 
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The signs of Eq. (2.30) and Eq. (2.31) are opposite. We could put Planck’s 
constant back into the equations by replacing every H by H/h. From Eq. (2.29) 
we obtain 
Tr p(t) = Tr [e~*(p"(O)e™)] = Tr [(o"(O)e™)e "7 
= Tr p"(0) (2.32) 
where we use the fact that Tr AB = Tr BA. It follows that Tr p” is constant in 


time, and consequently Tr /(p) for any function fis constant. This result can 
also be obtained by noticing that 


p” = Ywili<il, 
SO 


flo) = Y fowDlid<il. (2.33) 


Tr f(p) = X; f(w,), and the w,’s are constant. [being the eigenvalues of p and 
thus unchanged by a unitary transformation such as Eq. (2.29)]. 
It is possible to define “‘entropy” by the equation 


a i —— 2; W; In W;- (2.34) 


A pure state has “S’? = 0, whereas very impure states have large positive 
“entropy.” The quotation marks are used because, by our definition, the ‘‘en- 
tropy” cannot increase in time but must remain constant. 

Next let us try to get a bit more experience in dealing with density matrices. 
In ordinary quantum mechanics it was easy to find such things as the expectation 
value of the position, the expectation value of momentum, and the probability 
that a system is in a given state. 


(a) Expectation value of position: According to Eq. (2.14), if A is an operator, 
<A> = | dx dx’ p(x, x')A(x’, x). 


In this case A = x, and 


A(x’, x) = <x'|A|x) = x<x'|x) = x d(x’ — x), (2.35) 
SO | 


CYS [a xp(x, x). (2.36) 


(b) Expectation value of momentum: Now we must take A = p. Since 


ixi ply =" 2% gx, 
i Ox 
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we have 
! h 0 ’ 
A(x’, x) = — — d(x’ — x). (2.37) 
i Ox 


Integrate by parts; for any function f we have 


[700 3. ace — 2) ax - | 8” — 9 fo) dx = -F'@) 
Ox Ox 


<p> 


-| as dx’ a d(x’ — x) Z p(x, x’) 
i Ox 


eee | dx E (x, »)| (2.38) 
l Ox x’=x 


(c) Probability that a system is in state |y): Finding the probability that a 
system is found in the state |v» is equivalent to finding the expectation value 
of the operator |v)<y|. To see this for the case of a pure state, notice that if 
the system is in a state |W), the probability of experimentally finding it in 
state |) is 


KAW? = Wad = WI GdIy?. 


Therefore, the probability that the system is found in state |y) can be written 


P = probability = Tr plxy><x| = Tr } wilid<ilx><xl 
= Ywil<ilol? = <xlelo. (2.39) 


The equation 
P = Y wil? 
is in accord with our interpretation of the w; as the probability that the 
system actually is in state |Z). 
The probability density at value x, for a coordinate is the probability 
that the system is in state |x)), which by Eq. (2.39) is p(x, xo). For a pure 
state |W), p(Xo, Xo) = W(x)”. 


2.3 DENSITY MATRIX IN STATISTICAL MECHANICS 


When |9;) is an eigenket (eigenfunction), and EZ; is the corresponding eigenvalue 
of the Hamiltonian H of the system, the probability that the system is in the 
state |~;) is 

(1/O)e~ FF, (2.40) 


2.3 Density matrix in statistical mechanics 47 


Thus the density matrix is 
p = Si wyl@,><Gql, where w, = = e~PE, (2.41) 


Alternatively, the coordinate representation is 


! 1 — BE; ’ 
p(x, x‘) = ri PPD (x) 97 (x'). (2.42) 
Because H|g,) = E,|9,>, we can write Eq. (2.41) as 
p = 1S eMlg,d<e,| = —— (2.43) 
Q‘r “en 
where 
eFF=a=Q=) ¢%*=Tre ™, (2.44) 
SO 
eas 2.45 
OS Te eo OH a 
The average energy of the system, U, can be written 
Tr [He **] 
U = Tr pH = ——_—.. 2.46 
p Tr [e-P] (2.46) 


When we know the free energy and the energy of the system, the entropy is 
derived from 


F=U-TS. (2.47) 


It can be shown that, if Hp is any other Hamiltonian, 


F < (A), — TS), (2.48) 
where 
_ Tr [He ?"°] 
(Ho = Foti (2.49) 


and Sp is the entropy calculated from Ho. Fin Eq. (2.48) is the true free energy 
corresponding to the true Hamiltonian H, as written in Eq. (2.44). When we 
write the free energy corresponding to Hy as 


Fy = (Ho)o — TSo (2.50) 
and subtract Eq. (2.50) from Eq. (2.48), we may write 


Tr [(H — Hy)e~8"°] 


F< Foy + 
: Tr [e7 9?°] 


(2.51) 
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This theorem will be proved later, in Section 2.11. 
Now, we regard the density matrix as a function of B: 


p(B) = e7 FF /Tr e7 FF, (2.42) 
The unnormalized p is defined by 
Pu(B) = eM. (2.52) 


In place of py we will hereafter write p. Next we will show that p(B) obeys the 
differential equation 


—0Op/0p = Ho. (2.53) 
The proof is as follows. In the energy representation, Eq. (2.52) can be written as 
Piz = Oye *™, (2.54) 

Differentiating this equation, we have 
— 6p;;/O8 = 5B °" = E,p;;, (2.55) 


from which we obtain Eq. (2.53). 
The initial condition for Eq. (2.53) is 


p(0O) = 1. (2.56) 
We can write Eq. (2.53) in the position representation as follows: 
— Op(xx'; BOB = H,p(xx'; B). (2.57) 


Here the subscript x on H,, indicates that H, operates on x in p(xx’; 8). The 
initial condition is 
p(xx’; 0) = d(x — x’). (2.58) 


2.4 DENSITY MATRIX FOR A ONE-DIMENSIONAL FREE PARTICLE 


We want to solve the differential equation discussed in Section 2.3 for a one- 
dimensional free particle. The Hamiltonian is 


H = p*/2m, (2.59) 
and the differential equation (from Eq. (3.57)) is 


=, ; —h? 0? 
ar = 5 55 ale x's 8) (2.60) 
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This is a diffusion-type equation, and we can write down the solution readily: 


p(x, x’; B) = F a exp | (sia) (x — | ; (2.61)* 


The numerical factor is chosen such that 


p(xx'; 0) = d(x — x’). (2.62) 
For a linear system of length L, the trace of Eq. (2.61) leads to 
kT 
“BF ,x) dx = 1m | 2.63 
| ptx, x) dx = 1 / AT (2.63) 


This is the partition function for the system. 


Problem: Show that the partition function for a three-dimensional system with 
N particles is 


e FF — VM(./mkT /2nh?)>". (2.64) 
Another way of deriving Eq. (2.61) is as follows: 
p(x, x"; B) = Ye Pm (x) Uin(*’). (2.65) 


If the particle is in a large box of volume V, we take 


— Vd *P = — we ip:x/h 
FY aay tt Yul) Hale) = eae 


Then Eq. (3.65) becomes 


p(x, x: B) = d*p e7 bp? /2moip/h: (x— x’) 
(2nh)° 


m 3/2 eaaces 
- (; 7) ~~ (2.66) 
TU 


as in Eq. (2.61). 


25 LINEAR HARMONIC OSCILLATOR 
The Hamiltonian for a linear harmonic oscillator is 
H = p*/2m + mw?x?/2, (2.67) 


* Note that for x,x’ constant, no expansion in powers of B exists near 6 = 0. This is, 


indeed, an important feature of p,_., and will influence the construction of perturbation 
series. 
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so that the differential equation for p is 


—dp — fi? 02 ma?” 2 
—_ = — — p + — : 2.68 
op 2 m Ox? 2 oe 28) 
Let us write 
mo ho hw 
re eae, =p oe os 2.69 
h j 2 P 2kT ( ) 


which simplifies Eq. (2.68) to 


— Oop —@p 2 
— = + 2.70 
if 3 c*p (2.70) 
The initial condition is 
p=0O(x-—x’) for f=0, (2.71) 
or 
p= |m — 6(€ — for f= 0. (2.72) 


The factor comes from the scale change in the 6-function. In general, 6(x — x9) = 
IF’ (IO f(x) — £(xo)] for f monotonic. 

For low f (high temperature) the particle should act almost like a free 
particle, as its probable kinetic energy is so high. Therefore we expect that, for 
low f, the density matrix for a harmonic oscillator will be given approximately by 


oé, €', f) x Vmal(4nhf ye E> 27/45, (2.73)* 
Knowing the Gaussian dependence of p;,. on €, we try the following form: 
= exp {—[a(/)é* + d(f)E + c(/)]}- (2.74) 
Use of Eq. (2.74) in Eq. (2.70) leads us to the equation 

af4+ bh’ +c’ = (1 — 4a’)&? — 4abé + 2a — Bb’, (2.75) 

where a prime (’) means a derivative with respect tof’ From Eq. (2.75), 
a’ = 1 — 4a’, (2.76a) 
b’ = —4ab, (2.76b) 
c' = 2a — b’. (2.76c) 


* This is only true for € #4 &’. When they are equal, p(é, ¢’, f) is equal to p,.., only 
for f = 0. See Section 3.10 for details. Obviously, when 7 is large and the classical 
motion has large amplitude, the larger the amplitude the less will the oscillator feel a 
force between fixed x and x’. 


2.5 Linear harmonic oscillator 51 


Equation (2.76a) is integrated to give 
a = (2) coth 2(f — fo); 
where the initial condition Eq. (2.73) requires that the integration constant f, 
vanish: 
a = (4) coth 27. (2.77) 

Equations (2.76b and c) are integrated as 

b = A/sinh 2f. (2.78) 

c = (4) In (sinh 2f) + (A?/2) coth 2f — In B, 


where A and B are constants. Use Eqs. (2.77) and (2.78) to get 


B e Aé A? 
p= exp} —|— coth 2f + — + — coth x). (2.79) 
sinh 2f € sinh 2f 2 
When we let f > 0, Eq. (2.79) gives 
2 2 
esis lees sae . (2.80) 
V2f af 


For this result to agree with Eq. (2.73) we see that 


Mm 


A= —€', B= ;/|—. 2.81 
‘4 ae (2.81) 
Using Eq. (2.81) in Eq. (2.79), we obtain the final form: 
; mo 
x,x': pf) = _ |——————_ 
pdx, 2° B) ey 
—-mMm@ 2 12 ’ 
x exp {——_—_ + h2f— 2 2. 
p im sinh 9 t x’*) cosh 2f xX |} (2.82) 
or 
; mo 
x, x'; 8B) =) | ———____—_— 
pt, x5 B) J Qnh sinh (ho/kT) 
— mo 2 a hw ; 
x exp ¢—______—_ + h—- — 2 . (2.83 
i sinh (hao/kT) c Oe |} ce) 


The special case when x = x’ gives 


2 mMm@ —-m@ 2 
9 9 = Nise oS a ee <r t h . ° 
p(x, x; B) ’| on sinh Of exp ( ; x* tanh f (2.84) 
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This is the probability for finding the system at x. Notice that it is of Gaussian 
form. 
Equation (2.84) is used to calculate 


(x2) = Leto ds Fk cop = * coth 2% (2.85) 
{ p(x, x) dx 2m@ 2ma 2kT 


The average of the potential energy is thus 
{Potential energy) = (4)mw?{x*) = (fa/4) coth f. (2.86) 


We may compare this result with the average of total energy calculated by means 
of the partition function. We know 


hoe *?"T falt+e tha 


hw 
Ener = — + —____ = — — coth 2.87 
pny? 2  1-e fT 24-e°%7F 2 a 


Equation (2.86) is exactly half of Eq. (2.87). Therefore we know that 
‘Kinetic energy) = (Potential energy) = (%@/4) coth f. (2.88) 
The partition function is derived from Eq. (2.84): 


e~ FF — | o(x, x) dx = = = sates (2.89) 
2rh sinh 2fV m@tanhf 2 sinhf 


which leads to 
= “m (2 sinh f) = kT In (e%@/2*7 — @~fe/2kT) 


-= + kT In (1 — e7 Folk), (2.90) 


This is the free energy, as already derived. 
We may examine the limit of high temperature, or small fin Eq. (2.84). In 


this limit 
—ma?x?/2 —V(x) 
e oo — 5 2.91 
p(x, xX, B) — exp ( kT ) exp ( kT ( ) 


except for the factor in front of the exponential. This result, Eq. (2.91), agrees 


with classical mechanics. 
The low-temperature limit is found by inspection of Eq. (2.83): 


_ 2 _ 2,12 
p(x, x’) > ex ota op (Fr) exp ( ra exp ome, (2.92) 


This limit is calculated as 
p(x, x’) = Ye Mo (x)oF(x') > eG o(x) Pox’), (2.93) 


2.6 Anharmonic oscillator 53 


because when B — oo, only the ground state has an effective contribution. 
Equation (2.92) agrees with Eq. (2.93) because we know that for the ground state 


MN@ He — mo@/2h)x2 
Po(x) = 3, a aa (2.94) 
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Consider the potential energy shown by the curve in Fig. 2.1, given by the follow- 
ing function: 


2 
V(x) = oF x? + kx, (2.95) 


We will consider the area near x = 0 only, so that the region V(x) < 0 does not 
come into the calculation. 

Because of the anharmonicity, when the temperature increases the mean 
position of the oscillation moves out, as shown in Fig. 2.2. Bearing this in mind, 


Fig. 2.2 Mean position of the oscillation is displaced. 
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we will treat the problem using the minimum principle discussed previously. 
The principle is written as 


F < Fo + (H = Hoyo: (2.96) 
Here F is the true free energy and A is the true Hamiltonian. When 4) is any 
Hamiltonian, (), means the average taken in a system characterized by Hy. Fo 
is the free energy of the system of Ho. 
In the present problem we take for H, 
Ho = p*/2m + (mw?/2)(x — a)’, (2.97) 
which describes a harmonic oscillator with its center displaced by an amount a. 
The @ in Eq. (2.97) is the same as that in Eq. (2.95). From Eq. (2.90) we find that 
Fy = kT In (2 sinh (Aw/2kT)). (2.98) 
Now we have only to find 
2 2 
C= HS. = x? + kx? — a (x — a) (2.99) 
0 


If we make the substitution y = x — a, Eq. (2.99) becomes 
2 mo”, 3 2 2 3 
<H — Ho>o = Le ea + ky~ + 3ky*a + 3kya“ + ka~). 


0 


(2.100) 


Because Hy = p?/2m + (mw*/2)y”, we can easily calculate the expectation 
values of the various powers of y. By symmetry, 


CYyo = {Yo = 0. (2.101) 
From Eq. (2.85), we obtain 
<y*>o = (h/2mo) coth (ha/2kT). (2.102) 
Substituting Eq. (2.102) in Eq. (2.100) yields 
F < Fy + (H — Hoyo = Fo + (mw?/2)a” + ka’ + 3kaCy*)o. (2.103) 


The best estimate of F from Eq. (2.103) is obtained by minimizing the right- 
hand side. In so doing we neglect the term ka® because we are interested only in 
small oscillations, so that V(x) < 0 does not occur. Then the differentiation of 
Eq. (2.103) with respect to a leads to 


ma’a + 3k<y*)> = 0, (2.104) 
that is, 


—3k 
a= 2 Cy 0 = 
mo 


_ 3kh coth (o/2kT) 


2.105 
2m7a° ( ) 
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In this case, Eq. (2.103) becomes 
F < Fy — (ma?*/2)a’. (2.106) 


In Eq. (2.105), the displacement of the mean position, a, has a simple interpreta- 
tion. It is the point at which the average force on the oscillator vanishes. This 
result is seen as follows: from Eq. (2.95), 


Force = mw?x + 3kx? = mw*(y + a) + 3ky? +°:: (2.107) 


<Force) = mw?a + 3k¢y*) +:°°. (2.108) 


Therefore, when we neglect the higher powers of a, we arrive at Eq. (2.104). 


Problem: Consider a general pounding potential V(x) like that shown in Fig. 
2.3. The Hamiltonian is 


H = p?/2m + V(x). (2.109) 


In order to apply the variation principle in Eq. (2.96), try the following H,: 


2 2 
Pp MoWo 2 
Hy = —— + — (x - : 2.1 
oa, 5 ( a) (2.110) 


and vary a, Mo, and @,. Show that 


i) M =m; (2.111) 
11) @p and a are determined from 
<V'(x)>o = 0, (2.112) 
(xV'(x))o = (p?/m)o. (2.113) 
V(x): 


Fig. 2.3 A general bounding potential. 
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Here <), is taken for Hy. Equation (2.112) comes from the consideration of the 
absolute location of the system, and Eq. (2.113) comes from the scale of this 
variable. The latter corresponds to the virial theorem, which states that 


(xV"(x)) = <p*/m). (2.114) 


To show how Eq. (2.114) is related to considerations of the scale of the 
variable, let us derive the virial theorem in a more general form. 

Consider a system inside a cubical box with sides of length L. The Hamil- 
tonian is taken to be 


2 
H,=Y. — + ¥ V(r). (2.115) 
pee Ge 
(We assume that the motion of the system can be described by a potential 
depending only on the distances separating pairs of the particles making up the 

system.) rj; = |r; — ryl- 
If the system is bound, then for a sufficiently large box the pressure is 
essentially zero. But in general, 


cL Anarene Lil (2.116) 
0 (Vol) 3? OL 
Let F,, = F for a box of length L. When we later prove the theorem that 
F < Fy + (H — Ho)o, we will in fact prove that (H — Ho), is the first-order 
correction in the perturbation expansion for F. We get an expression for P by 
writing 


P = Pressure = — 


Fici+e) © Fy + correction term 
or 
Figi+e) © Fr + (Aas. — Arda,- (2.117) 


Now Ax; +2) differs from H;, only in that (r;), can run from 0 to L(1 + «) rather 
than 0 to L, and similarly for (r;), and (r;),. Therefore let us make a change of 


variables, such that ri = r,/(1 + ¢). As p? = —h’V?, we obtain 
12 
Di , 
on 2 2m{1 + &)* » L i] 
w<J 


2 12 
Dy ! Di ’ ’ 
w VP 4 SY veri) +e| 2 + YriVi'p| (2.118) 
» 2m; 4 Ow) 2 2m 4 ; ‘ 
i<j i<J 

Since the new coordinates (primed) run over the same range as the original 
(unprimed) ones in Eq. (2.115), we can eliminate the primes over the r;; and P; 
and write 


CAyi+e — Adu, = | -2 <kinetic energy> + ( rv (ra) ; (2.119) 


2 | 
i<j 
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From Eq. (2.116) and Eq. (2.117), 


spy = —L OF Ly Ferg — Fe ta. (2.120) 
OL Le E 


From Eq. (2.119), 


3PV = 2 (KE) — ( rv). (2.121) 


i<j 
For the case of a harmonic oscillator, P is zero and we get 


(xV'(x)) = p?/m). (2.122) 


This result should be compared with Eq. (2.113). 
For the case of a particle moving in an inverse-square force field, we get the 
classical result 


(KE) = —(4)<potential energy). (2.123) 


This method can be generalized. Let H be the Hamiltonian for the original 
system and let H, be the Hamiltonian with the box changed in an infinitesimal 
manner. Suppose Fy, = Fy, and suppose that H, — H = eS. Then the above 
method gives (S), = 0. For example, in Eq. (2.118) we have the case for which 


2 
Pj ’ 
i<j 


As another example, suppose that the box undergoes an infinitesimal transla- 
tion, ¢, in the x direction. We get, by taking x’ = x — g, 


D;" | AV 
H,~ Yi + VV) + ey — (ri), (2.124) 
i 2m; ip ij Ox 
j i<j 
so that 
sa 26) (2.125) 
‘ij. OX 
L<J 
and 
yo) (2.126) 
‘tj, OX 
i<j 


Compare Eq. (2.126) with Eq. (2.112). 
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2.7 WIGNER’S FUNCTION 


The density matrix may be written in the coordinate representation and in the 
momentum representation as follows: 


p(x, x") = Ye Fp (xp Xo’), (2.127) 


p(P, p') = Ye “of p)gF(p’) 


= | oC: x’ ye UMP XPD dy dx’, (2.128) 


The diagonal element 
p(x, x) = P(x) (2.129) 


represents the probability for finding the particle at x (neglecting the normaliza- 
tion constant 1/Tr p). Similarly 


p(p, p) = P(p) (2.130) 


is proportional to the probability for finding the particle at p in momentum 
space. 

The expression Eq. (2.130) is used in calculating the average value of the 
kinetic energy: 


( Pp } _ Tr [(p*/2m)p] _ J p(y, pi p?/2m)(ap/2nh) (2.131) 
2m Tr [p] § e(p, p)(dp/2zh) 


In classical mechanics, the density function f/(p, x) in phase space has the 
following property: 


P(p) = [70 x) dx, 


(2.132) 
P(x) = | fp, ») P. 


We ask ourselves if there is any function f(p, x) in quantum mechanics that 
satisfies Eq. (2.132). One answer is Wigner’s function f,(p, x): 


fw(p, x) = {o(: + = x —- 1) e~ pn) dy, (2.133) 


This function is derived by regarding p(x, x’) as a function of (x + x’)/2 and 
(x — x’)/2, and then writing (x + x’)/2 as x and making a Fourier transform 
with respect to (x — x’)/2. 


2.7 Wigner’s function 59 


Let us check that Eq. (2.133) satisfies Eq. (2.132). First, 


ap ea 4 a) q ge i(pn/h) dp d 
[oC 2 = {os + so Xf ¢ = n 


= {= + - x - 5) o(y) dn = p(x, x) = P(x). | (2.134) 


Thus the second part of Eq. (2.132) is satisfied. Next, using Eq. (2.128) and 
Eq. (2.130), we have 


P(p) = p(p, p) = | p(x, x’je~ MPE-*) dx dx’ 


= {o> + - y- ") eW'Pnl® dy dyn = | Jot y)dy, (2.135) 


where we have made the change of variables: 


x = yt n/2 
eal (2.136) 
x = y — n/2. 
From Eq. (2.132) we see that, if h(p, x) is either a function only of p or a function 
only of x, then 


Giles | fu ps x)HCp, x) SP. de (2.137) 
1th 


However, Eq. (2.137) is not true for a general function, A(p, x). 

Although Wigner’s function fy(p, x) satisfies Eq. (2.132), it cannot be 
regarded as the probability for finding the particle at the point x and with the 
momentum p, because f,,(p, x) can become negative for some values of p and x. 


Problem: Calculate Wigner’s function for a harmonic oscillator. This is a 
special case where f,(p, x) is always positive. 

In Eq. (2.133), we derived Wigner’s function from the density matrix. We 
can derive the latter from the former: 


e x—- xX d 4 
| Jol ».** a ay edie oe p(x, x’). (2.138) 


Thus fy,(p, x) and p(x, x’) contain the same information. 


Problem: Show that the momentum representation of the free-particle density 
matrix 1s 


p(p, p’) = 2nhd(p — p’je P/2m/KT’ (2.139) 
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2.8 SYMMETRIZED DENSITY MATRIX FOR N PARTICLES 


We will now discuss the density matrix for a system of many identical Bose or 
Fermi particles. 


The symbols will mean 
D Distinguishable 
S Symmetric 
A Antisymmetric 


First consider the case of free particles. The Hamiltonian is 


— fi 


; oe 
2m 


> Ve: (2.140) 


The density matrix pp satisfies the equation 
Opp/Op => — Hpp. (2.141) 


The solution is 


m_ \3N/2 74 ; 
poly Xap 3s Hi Xin h5 B= ( exp | — pag DO Xy) | 


2nh?B ; 
(2.142) 
In general when particles are interacting with each other we can write 
Pp(X15 X25 +--+ XN3 X4,X2,---Xn3 B) = a de eT PP iy (x1 +++ XN) WF (x4 ++ Xy)- 
(2.143) 


When the particles obey Bose statistics, then 


Ps(X-+- 3X4 --- XW BY=— Ye eM W(x xy WFO xy). (2.144) 


Symmetric 
states only 


We introduce the notation P to indicate the permutation of particles. For 
example, for a permutation 


we have 


When the state w; is symmetric, 


WAPX,) = Wir). 
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We can construct p, starting from pp. As an example consider the case of two 
particles: 


Po(X1X23 X4X4) = » eT PF (x1, X)Wi (x4 X9). 
all states 


Now interchange x; and x;: 


Py(X1X23 X4X4) = eo eo PFiy (x4 x2)WF (x54). 
ali States 


Take the average of the two: 
3 Pp(%1%23 X4X9) + Pp(X1X2; X2x4)] 


= Ye Bab xy x2) LVF 4x2) + WFOX1)] 


all states 


» e PP iy (x4 x2) Wi (x44), 


symmetric 
states only 


because 


0 antisymmetric wy} 
wit(xx5) symmetric y*. 


MWtGxix)) + WtCsx4)] = 


In general, a group-theoretical argument can show that 


W(X, °° X,°°*Xy) symmetric 
0 for any other symmetry. 


(2.145) 


wi MPa Pe Pay) = | 


Here >>p means that we sum over all permutations. When we use Eq. (2.145), 
the case of N particles can be written as 


Pr ine ee ey) 
1 / / / 
a et Rg ge EE) (2.146) 
Thus the partition function becomes 


e bFs m= | aster sexes a) ds da 


= wi | Pater se es Pease Pag) dx,:::dxy. (2.147) 
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Problem: Derive the partition function for a noninteracting Bose gas. Use 
Eq. (2.142) in Eq. (2.147) and first write 


ons _ 1 m 3N/2 
N!\2nh7p 


+ (xy — Pex) dx,+*:dxy. (2.148) 


In taking >°p, we start with the identity permutation that gives V% from the integ- 
ral and next go to a pairwise interchange of particles, three-particle cyclic 
permutation, and so on. The combinatorial factors must be taken into account. 


Solution: Any permutation can be broken into cycles. For example, 


123 4 5 6 1 5 6\/2 3\/4 
r-( {itd i) = asoran (1 l ( (i) 
53246 1 5 6 1/\3 2/\4 


Suppose a permutation contains C, cycles of length v, where >°, vC, = N. For 
v > I, let 


m 3v/2 
h, = a ae dx, dx, pee dx, 
2nh? Bp 


pbc i or [(x, — x2)? + (x2 — x3)? +°°° 


$y — a + | (2.149) 
3/2 3/2 
h,=(—"-) |ax, =v(—_| . 
2nh7 B 2nh? Bp 
Then Eq. (2.148) can be written 


pte as : (1 ny). (2.150) 


It is not difficult to show that 


fa exp [—a(x — y)*] exp [—b(y — z)’] 


7 nz \3/2 _ ab a ; 
-(5) exp | re |. (2.151) 
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Using Eq. (2.151) we obtain 


m 


3/2 | 
eng guanine ie 2.152 
eae) ( 


We must now count the possible permutations. Let M(C,,...C,) = the 
number of permutations that have 


C, cycles of length 1 
C, cycles of length 2 


C, cycles of length q 


Then 
= 1 
ge FFs > M(C,,..-,C,) [] (h,)~. (2.153) 
NS Circe Cc; v 
As a preliminary to the determination of M(C,,..., C,), consider the case of 


N = 6, C, = 3, C3; = 1, C, = Cy = Cs = Cg = 0. An example is the 
permutation P = (5)(3)(2)(461). Notice that the sequence Q = 5, 3, 2, 4, 6, 1 
corresponds to P. In other words, from the sequence Q, we can uniquely con- 
struct the permutation P with C, = 3, C; = 1. But from Q’ = 2,5,3,4,6,1, 
or from Q” = 5,3,2,1,4,6, we can also construct in the same way the same P. 
In fact, it can be easily seen that there are 3! times 3 = 18 different sequences 
that can be used in the way described above to construct the permutation 
P = (5)(3)(2)(461). 

Generalizing to any set of C’s, we see that there are two ways we can get a 
new Q that leads to the same P: 


a) we can interchange cycles of the same length, 


b) we can make cyclic permutation within a given cycle. 


There are [[, C,! ways of interchanging cycles of the same length, and [], v“ 
cyclic permutations within cycles. Thus for each P there are [], C,! [], v™ 
different Q’s that lead to the same P. Because there are N! possible Q’s, 


N! 
M(C1,--+4 Cd) = Fg: 
jae vO 


It follows that 


—BFs hy” 
eMs= yy [I= (2.154) 


>] 
C1,C2,...Cq v C,! yo 


where >, vC, = N. 
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Instead of trying to do the sum in Eq. (2.154), we will find the free energy 
of the grand canonical ensemble (that is, we will let N vary). 


erie \ ie tine (2.155) 
N=1 


As usual, let « = e*4“*. Then, 


= ~> Il yvcy = y Tl [A [A,(@/v) 


-Cq y Cc. Tae Cig 0. Cg a vy ©: ! 


where now each C, runs from 0 to oo. Interchanging [] and >, we have 


e-PF TT y bs oe = ~ (" *) = exp » h, ): 


y Cy=0 


Then, 


BF 


2nh? 


oe ha” m 3/2 Oo” 
— = es V yan’ (2.156) 


This result is identical to the one we obtained in a much simpler way earlier. 
When the density of the gas is not large, only the identity permutation is 
important for high temperatures or for the limit of 4 — 0, since we have such 
factors as e7 lmkT/2hy« — x)"1 This means that the quantum effect appears only 
for low temperatures. 
For the antisymmetric case, 


DAO OP Kye rt ye or ke xy) 
1 P / / / 
meer yy (-1) Drier Kye PX OPP, Oe Pky); 


where 
l)p = 1 for even permutation 
: —1 for odd permutation. 


Problem: Modify the argument leading to Eq. (2.156) so as to obtain the cor- 


responding result for a noninteracting Fermi gas. 


2.9 DENSITY SUBMATRIX 


For N particles, the density matrix can be written 


P(X; 5 XN> Xi X, oXs:.3 Xy) = x WiP(X,, ae ., Xy)QF (x4, cg Xn). (2.157) 
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Notice that we are not now specializing to the case of statistical mechanics. To 
compute the expectation value of P?/2m, we proceed as follows: 


2 2 
2m/ Trp | 2m 


1 h2 CQ A sad Ae 22g 
= Fey am | ae PR Iie RR] de da 
fy? 0? : _ P? 
= or dx ax? pP1(%1, X4) ag = Tr p, ay (2.158) 


where we define 
0,(x, x’) = J PC%, X25 +++ 5 Xs X's Xay ++ + Xn) AX2 11+ dy (2.159) 


Tr [p] 


Pp, is the one-particle density submatrix. Using this matrix we can calculate, for 
example, 


De\ _ he 0° = Pe 
(Z a = —N iE Axi p4(x, “| _ dx = NTr = P| (2.160) 


(assuming that the particles are essentially identical). 


Similarly, 
(Z vex)) = N | V(x)p,(x, x) dx = N Tr [Vp,]. (2.161) 


The two-particle density submatrix is defined as 


j P2(X1X2X3 °° X53 X4X2X3 °° Xy) dX3°** dXy 
Tr [p] 
From Eq. (2.162), the two-particle potential energy V(x,x,) has expectation 


( V (x;, “)) = ee | V (X1X2)P2(X1X25 XX2) dx, dx. (2.163) 


i<j 


PX 1X23 X4X2) = (2.162) 


Special cases of density submatrices are the distribution functions. The single- 
particle distribution function is defined as 


Pi(x) = py(x; x). (2.164) 
The two-particle distribution function is 
P2(X, Y) = P2(x, y; x, y). (2.165) 


In a uniform substance, such as a gas or a liquid, p,(x) is independent of x, and 
P2(x, y) depends only on the difference (x — y). 
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2.10 PERTURBATION EXPANSION OF THE DENSITY MATRIX 


Recall that the density matrix in statistical mechanics satisfies 


op 
-—- = —Hp. 2.166 
ry: p ( ) 


There are very few Hamiltonians for which we can solve Eq. (2.166) exactly, but 
it may be that H is close to one such Hamiltonian, Hp. 


H =H H, (2.167) 
Po = —Hopo- (2.168) 
op 


We would like to use pp to obtain an approximation to p, which we expect 
to be close to py = e~ 8#°, Because p is close to e~ °#°, we expect e#°%p to vary 
slowly with B. 3 


po ~ Het p ae ar = eH oF Hop = eH oP Hp = — eb Hp. 
(2.169) 


Integrating Eq. (2.169) from zero to B, and remembering that if 6B = 0, we find 


fiby = 1, 
eth (fg) — 1 = — | e!08'H o(B) ap (2.170) 

Therefore, 
p(B) = po(B) | po(B — BH, p(B") ap’ (2.171) 


The last term in Eq. (2.171) is small if H, is small, and it is a correction term to 
the approximate equation p & po. If the correction term is small, we can use an 
approximate p(f’) to obtain a much more accurate approximation to p(8). For 
example, if we make the approximation p(6’) © po(B’), we have 


p(B) = pol) — | ” po(B — BH pol 6") AB. (2.172) 
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In using Eq. (2.172) as a new approximation to p(f’), we can find a still better 
approximation to p. Continuing in this manner we get 


Air pes |, dB'Loo(B — BH .po(B)] 
0 
4 | ” ap | dB’ Lo B — BH sp0(B" — BH, p(B") 
¢) ‘¢) 


B B’ Bp” 
— | ap’ | ip" | dp’, je ce (2.173) 
0) 10) O 
We can easily rewrite Eq. (2.173) in coordinate representation. For example, 


p(x, x’; B) = <x\|p(B)|x’> 


See ee | ” <xlpo(B — B’) ( | bx") ¢x"|dx " Hy po(B’)Ix’> dB", 


(2.174) 
where we use 


Lx" dx"| dx” = 1. 
If H, = V(x), then = 
Cx" | polBY |x") = V(x")polx", x’; BY), 
and Eq. (2.174) becomes 


p(x, x"; B) = Pol; x’, B) 


00 B 
- | | Polx, x"; B — BV (x")po(x", x’; B’) dB’ dx” + >>: 
—-o /0 
(2.175) 
2.11 PROOF THAT F < Fy + <H — Hoo 


Now that we have a perturbation theory, we are in a position to prove as 
promised in Section 2.3, that F< Fy + (H — Hoyo. Let V= H — Ay. 
Since F is determined by the equation e 8? = Tr e~ 8#0t”) the most obvious 


approach is to find an approximation to F by using the perturbation expansion of 


= p. 


B 
e7B(Hot+V) _ p~BHo -| eo (B-u)Hoy p—4Ho dy 
0 


B (us 
+ | \ du, du, e” (B-u1)Hoy 9~ (41 — U2) Ho 9~ 42H Sirota: (2.176) 
0 O 
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Taking the trace of Eq. (2.176) we obtain 


B 
e7 FF = Tr e  AtHotV) — g-BFo _ | Tr [e7 FM Foe “#o7] dy 
0 


B fu 
i | | 1 ‘ii du, Tr [e7 A ~ a HoYg- (u1— 42) Ho e— u2Ho] a aide 
Oo yO 


(2.177) 
Using the fact that Tr 4B = Tr BA, we find that 
B 
e~ FF — e  BFo _ | du Tr e~ SHoy 
0) 
B fu 
+ | | du, du, Tr [e FB °e1—u2)Hoye —(1—ua)Hoy] —.--. (2.178) 
0 J0 


It is possible to simplify the second-order term. Let w = u, — u, and let 
x = u,. Then the integral becomes 


| Tr [e Phew toVe wHoy 7, (2.179) 
A 


where A is the shaded region in Fig. 2.4. Now let w’ = B — wand let x’ = B — x. 
Then the integral becomes 


| Tr [e PH oevHoVe-wHoy 7. (2.180) 
Pe 


Averaging Eqs. (2.179) and (2.180), we find, from Eq. (2.178), 


B 
e BF — e PFo _ BTr [e PPoy ]. 4 | dw Tr [e~F#oewhoVe-whoy] 4 +--+, 
0 


(2.181) 


Fig. 2.4 The shaded area A is the region over which Eq. (2.179) is integrated. 
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Let |m) and |n) be eigenstates of Hp, and let E,, and E, be their respective 
eigenvalues. Then 


Tr [e FV] = ¥ <nle“F°V [n> = Ye FV, (2.182) 


Tr [e7 FBogwHoyewHoy] = ¥ <nle FhoewHoy ¥° |m><mle*#eV [n> 


= ye ee On Vos (2.183) 
where we have used the fact that >, |m><m| = 1. Equation (2.181) becomes 
B e bEm = e BEn 
e~ FF — eg FFo — YS" Be PV, + - > i [Vinnl? oct. (2.184) 


Note that if m = n, 
e@ bEm _ e@ BEn - 7 BEn eh (En— Em) _— 1 = Be7 FEn, 
E,, ¥, En E,, ~~ Em 
In order to get an expansion for F, we write F = Fy + F, + F, +°°:°, where 
F, is a first-order perturbation and F,, is a second-order perturbation. Then 


e FF — e~BFog- BUFitFat--) 
B? 
= e fF] — BF, — BF, + > Fi + Fp +-:-P +-:-) 


e7 bFo [ — BF, + G Fi - pr;) + eel (2.185) 


up to and including the second order. We have a perturbation expansion for F 
if we set 
—e PPORF, oe > Be PPny,, 


and (2.186) 
2 —BEm —BEn 
o7 PFo BY p2 ae) ee Bye tm — et V1. 
(5 1 B 2 4 » E, _ E,, | Wail 
Solving these equations we obtain 
Tr Ve™ Ho 
F, = ——— = (Vy, 2.187 
1 Tr o~ BHo < > H ( ) 


and 


py = B[De Mae OP _ A are yp ee yn 
ae 2 > e BEn fe ee eos ee = 


B = ae 


gets (2.188) 


Z 
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With the Cauchy-Schwarz inequality, 


Lab) <|¥ lal] Sb? |, 


it is easy to show that for any set of positive real numbers, w,, 


pas Wran : < a Wal al? 
Dae Wh 7 Din Wr 
Applying this inequality to Eq. (2.188), we have 


F.< 1 pro x ee iy 2 <0 (2.189) 
oe m E,~Eq_ 


Nowy, let 
H(a) = Ay + aV, 


and let F(a) be the free energy calculated from H(«). Then F(O) = Fy and 
F(1) = F. From the preceding perturbation analysis, 


F(a) = Fy + aF, + a7?F, + 0(a°). 
So F’(0) = F,. If we can prove that 
F’(a) < 0 for all a, (2.190) 


then F(a) will be concave downward (see Fig. 2.5) and thus it lies below the line 
F(O) + «F’(0) = Fy + aF,. From this we can conclude that 


F< Fyot Fy = Fo + (Vu, 
Or 
F < Fo + CH ae Ho) a» (2.191) 


which was to be proved. 


Fig. 2.5 The situation when F(«) is concave downward. 
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To prove Eq. (2.190) we apply the perturbation analysis to F(a) for each a; 


that is, we write 


Fa + y) = Fo(a) + yFi(@) + 9°F2(a) + O(y*), 


where F,(«) = F(a), and yF,(«) and y*F,(a) are as in Eqs. (2.187) through 
(2.189), with H, replaced by H(«) and H by H(« + y) = H(a) + yV. The 


result of Eq. (2.189) is then 
F. 2 (a) Ss 0, 
so that 
dz 
Fa) = —, F@ + y) = 2F,(a) < 0. 
dy y=0 
This proves Eq. (2.190) and thus completes the proof of Eq. (2.191). 
Applying this same theorem to Fy, we obtain 
Fo < F+<(—V)y. 
Combining Eqs. (2.191) and (2.192), we see that 
Fo + Voy S FS Fo + (Vu, 
A more accurate approximation F is obtained by including F,: 
B 1 
2 


B 
Fr Fo + (Vo + B <V>e—- 54 eWHoy e— wHoy iw) , 
0 0 


(2.192) 


(2.193) 


(2.194) 


CHAPTER 3 


PATH INTEGRALS 


3.1 PATH INTEGRAL FORMULATION OF THE DENSITY MATRIX 


Earlier it has been shown that the density matrix satisfies the equation 


OO ex ay, (3.1) 
Ou 
and has the formal solution 
plu) = e Bulh, (3.2) 


where it will be noted that u has been redefined to be BA, which has the dimen- 
sions of time. Furthermore, the “time” u can be broken up into divisions of 
length e(ne = u) so that p can be developed incrementally as follows: 


o(u) = e7 Helig—Helh . . , 9—He/h 


P-P.’** p(n factors). (3.3)* 


In the coordinate representation for the density matrix p(x, x’; u) the solution is 
represented as 


p(x, xs u) = \ + | pes Xi CO Naas Xn-2> é) = pP(x4; es e)dx, a ae AX, — 1. 
(3.4) 


The representation in Fig. 3.1 is useful in the interpretation of the above expres- 
sion. The particle travels from x’ to x through a series of intermediate steps, 
X1,%X2..+,X_—1, Which define a “path”. The total amplitude p(x, x’, u) for the 
particle to begin at x’ and end up at x is given by a sum over all possible paths, 
that is, for all possible values of the intermediate positions x;. As the time in- 
crement ¢ is allowed to approach zero, the number of integrations on the inter- 
mediate variables become infinite and Eq. (3.4) can be written symbolically 


este {| ®[x(u)] Dx(u), (3.5) 


* The reason we want to reduce ¢ as much as possible is, of course, that the asymptotic 
behavior of p at high temperatures can be easily found (see below). 
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Fig. 3.1 A particle travelling from x’ to x along a path x,, x, etc. 


where 
®[ x(u)] = a p(x, Xn-1> E)P(Xn—1 Xn-2> é) as p(X; Xs é), (3.6) 


and 


Qx(u) = lim dx, dx,:--+dx,_;. 


n> oo 


Although all possible paths are to be considered in Eq. (3.5), the main contribu” 
tion comes from the limited set of paths for which none of the p(x, x’; €) is too 
small. For example, we know that for low eg, 


m 4 — >! 
p(x, x’; &) ~d i e (m/2he)(x x ae 
2rhe 


which is the density matrix for a free particle. The term p(x, x’; e) is small if 


x-x>¥V 2h/m/ é; so the main contribution comes from reasonably smooth 
paths. Now let us find ® more explicitly, first for a free particle and then for one 
moving subject to a constraint. 


Free Particle: 


The Hamiltonian is 
H —f i Par < (3.7) 


From previous results we know 


! m — (m/2fie)(x— x’)? 
p(x, x"5 8) = Je eas Ay (3.8) 
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and 


2 
p(x, x’; U) = lim | -:- exp _ me} (xX ~ Xn-1 
20 Dh g 


: ax, AX,  AXy_4 . | (3.9) 
/ 2nhe/m a/ 2nhe/m . 2nhe/m 


Thus, for a free particle, 


®[x(u)] = lim exp = (S=) Se ta (* - =) I. (3.10) 
e>0 € & 


As the interval ¢ is made smaller, 


Xp *ko1 -. dx(u) 


= x(u) 


Fe du 


and Eq. (3.10) becomes 


1 id mre. 2 
O[x(u)| = exp{—— | —[x(u)]* dus. (3.11) 
hj}, 2 
Particle Moving in One Dimension in a Potential V(x): 
h? oO 
= —-——-~+V(x 
2m 0x? OD 
The equation for p is explicitly 
Op h? 0? 
—-h— = —-—-— pt+V(Qvj)p. 3.12 
Au See (x)p (3.12) 


In an infinitesimal “‘time’’ interval ¢ the particle may be considered as moving 
freely with a small correction due to the presence of the potential V(x). To a very 
good approximation 


dp = 9) EP free: (3.13) 


This result is demonstrated as follows: From perturbation theory we have 


ee | dx’ | pols, x"se— wVGepole’, x's), (3.14) 
— 0 0 
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where in our case, Po = Pree. For low e, because p,,,, is a very localized 
Gaussian, most of the contribution in the integral over x” occurs near x" = Xo, 
where it can be shown that 


ux + (e — u)x’ 
Xo — e 
é 


So we can, for small e, write Eq. (3.14) as 


: 


Cre a MH V(x6) \- dx” pol x, x"s2 — wpolx", x3 #) 


-| a V (Xo) Prree(X; xs é). (3.15) 
O 


Now, if x = x’, X9 is also close to x and V(xq) is constant over the range of 
integration: Equation (3.15) goes over to Eq. (3.13): 


6p(x, x’; &) & = V(X) Pfree(X, X'S &) 


under the assumption that |x — x’| and « are small. 
Of course, as |x — x’| increases, our conclusion that {% V(x9) du = eV(x) 
becomes less accurate, and it might be thought preferable to write something like 


: — 4+. ad oe RP (* + *) 
2 2 


in place of eV(x). But for our purposes, Eq. (3.13) is adequate. So 


iy 402, 


FOr eC [ ‘ 


| % Prec(ele Vor (3.16) 
Thus 


Mop f mx = x)? _ eV) 
p(x, x’; x | | is P | (3.17) 


To find p(x, x’; u) it is only necessary to apply the correction e~”*/* to each 
element of the “‘path’’ of the free particle from x to x’. Thus, as e > 0, 


p(x, x’; U) = [Je i“ \, ew + vew)| iu Dx(u) (3.18) 


= {| ®[x(u)] Dx(u), 
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where x(0) = x’ and x(U) = x. Thus 


O[x(u)] = exp Ij | ‘ aa + V(2tw) | aut (3.18a) 


The free energy is derived from 


eW FF — | oc x) dx 


= | \| exp = \, ew + Vatu) aul oxo} dx 
h 8) 2 x(0)=x(U)=x 


(3.19) 
where U = Bh. 


Having derived the general expression for ® the question now is, which 
“paths” are the most important? If the potential is not very dependent upon 
position, then the more direct paths contribute the most. This is because longer 
paths, with higher mean square “‘velocities,” [x(u)]*, make the exponent of the 
weighting functional ®[x(u)] more negative. On the other hand, suppose the 
potential does depend on position. For simplicity, let the potential increase for 
larger values of x. Then a direct path such as | in Fig. 3.2 does not necessarily 
give the largest contribution, because the integrated value of the potential is 
higher than over another path such as 2 or 3. Consider a path such as 2 that 
deviates widely from the direct path. Then V(x) decreases but xX increases over 
the path. In this case we would expect that the increased velocity x would more 
than compensate for the decreased potential over the path. Thus, the most im- 
portant path would be one for which any smaller integrated value of potential 
energy is more than compensated for by an increase in kinetic energy (sym- 
bolically represented by 3 in Fig. 3.2). 


Fig. 3.2. Three paths between x’ and x. 
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In the classical limit, the temperature is high and U is small. We want the 
classical partition function, and we would like to see how it is related to the 
quantum-mechanical partition function. From Eq. (3.18) 


p(x, x; U) = {| exp 7 | : kes + re) aul Dx(u). 


x(0)=x(U) =x (3.20) 


If we assume that the most important paths are those for which V(x(u)) = V(x), 
Eq. (3.19) can be approximated by 


p(x, x;U) & exp i v0] {| exp al | ° aad Qx(u) 


x(0)=x(U)=x 


exp a v0] Prree(X, xX; U) 


Mm —m(x — x’? — UV(x) 
J 2nhU hU h | — 


Equation (3.21) is just a specialization of Eq. (3.17). But from the derivation of 
Eq. (3.21) it is easy to see the direction of the error in the classical limit. We 
assumed that the most important paths had x(u) ~ x, whereas in fact the most 
important paths are those for which J§ [mx(u)?/2 + V(x(u))] du is a minimum, 
which occurs when (x(u)) goes over paths that tend to let V(x(u)) be lower than 
V(x). It follows, then, that 


Prtassical(% xX, U) < p(x, xX; U). (3.22) 


The classical partition function is 


enPFs = | exe UV) de S| gH UPN gy, 
[ei ) ae — x (3.23) 


From Eq. (3.22) we obtain 
fa & £. (3.24) 


To find the condition for Eq. (3.21) to be a good approximation to the path 
integral, note that the paths for which 


U 29 
7 ES di > I 
hj, 2 


do not contribute much to the path integral. Using the Cauchy-Schwarz in- 


equality, we find 
b 2 b b 
| 2 < | al al? 
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With f = x and g = 1, we have for 0 < u, < U: 


| " xu) < | " 0 < ( | i eu (3.25) 
0) 0 0 


So for the important paths, (m/2hU)|x(u,) — x(0)|? < 1. If dis the maximum 
value of |x(u,) — x(0)| for 0 < u, < U, then the paths that contribute signifi- 
cantly satisfy 


Ix(uy) — x(0)|* = 


mkT d? 
< 
2h? 


For the hydrogen atom at 1°K Eq. (3.26) yields d < 9A. For oxygen at 
1°K, d < 2.5 A; for oxygen at 100°K,d < 0.25A. © 

If the potential varies so slowly that it is approximately unchanged even 
when the path varies by more than d, then Eq. (3.21) is good. Thus, for heavy 
particles at normal temperatures, systems may be treated by classical statistical 
mechanics.. However, the classical approximation does not apply to liquid 
helium, nor does it apply to electrons in a metal, nor even to solids if the tem- 
perature is too low. 

A few things should be noticed about this formulation of statistical 

mechanics: 


(3.26) 


1. We have discussed only the case of one coordinate. However, the extension 
to N degrees of freedom is obvious; if x, x, ..., x“ are the coordinates, 
Qx(u) becomes 

Qx(u) = lim dx{? dx’? +++ dx dx$P +++ dxQ +++ dx. (3.27) 
no 

2. If ‘‘u’’ is replaced by “‘iu’’ in Eq. (3.1), we have Schrédinger’s equation. Ina 
similar manner to that of statistical mechanics, quantum mechanics can be 
formulated in terms of path integrals. For mathematicians, statistical mechanics 
is easier to deal with because the exponentials in the path integrals are exponen- 
tials of real quantities. 


3. We have not dealt with relativity (in which particles appear and disappear) 
and spin (which is a discrete, rather than a continuous, variable) in the path- 
integral formulation. If such effects exist, we can only get approximations. 
Actually, spin can be put into the formalism, but our interpretation of Dx(u) as 
a set of paths differing by an infinitesimal amount no longer holds. 


3.2 CALCULATION OF PATH INTEGRALS 


We will discuss two techniques for the calculation of path integrals: 
1. Integrals whose exponents are quadratic in x and x will be calculated exactly. 
2. Perturbation expansions. 
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Consider the element 
U ‘ 2 
Sf (X2, x1, U) = {| exp ea mle au| Qx(u) (3.28) 
0) 


where x(0) = x, and x(U) = x». This is the density matrix element for a free 
particle. Let us expand each path about the straight-line path in the xu-plane. 
Call the straight line path x(u). Then 


x(u) = x(u) + yu) (3.29) 


and 
1 m 7 : ; 2 
Flea, %1,U) = |} exp} | (eu) + HW) du | Dy). G.30) 
10) 
Now 
dx(u) %x,—- xX, , 
ls = -*£_— = vp = velocit (3.31) 
du U J 
so that 


\, [x + yp]? du 


0 0 


= {. (v? + 2wy + j*) du = v?U + 2v[ WU) — y(0)] + \. yp? du. 


(3.32) 
But (U) = y(0) = 0. Therefore 


2 U 
f(x, x,, U) = exp ew {| exp 5; \ y? du Dy. (3.33) 
18) 


Thus we have split the path integral into a factor dependent upon the endpoints 
and U, and a path integral dependent only upon U. Writing 


exp| —_ 52 du| Dy = F(U) 
2h Jo 
y(0)=y(U)=0 
we get 


p(x2,x,, U) = F(U) exp [—m(x, — x,)*/2hU]. (3.34) 


We can obtain an equation for F(U) by using 


p(x, Y;u, + U2) = | Ax’ p(x, x’; U2)p(x’, y; uy). (3.35) 
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The reader should derive that equation for F(u) and show that its most 
general continuous solution is 


F(U) = Vm/2nhU e%, (3.36) 
where « is an arbitrary number. Physically, the value of « is immaterial, because 
a change in the zero of the potential does not change the physics; it does, 


however, change the value of «. As a matter of fact, since the expectation value 


of any quantity, A, is given by 
Tr pA 
(Ay Ee 


3.37 
Trp (3.37) 


F(U) need not be found at all. It cancels out in the division. We know that a is 
zero, however, from other methods of calculating p (see Eq. (3.8)). 
Now consider the path integral for the harmonic oscillator: 


1 (°Tmx? = mo? 
x1,U) = 2 | [My me 62 | dul ox. 
I (Xo, X1, U) {|} 48s ; | ut x 


Let us expand each path about the path that makes the largest contribution 
to P. For this path, 


U U “2 2 
| G(u) du = | ks eee “| du = minimum, 
0 o 2 2 
where x(0) = x,, x(U) = x. Thus, our integrand must satisfy the Euler 
equation, 
MOOG ay op ae ne: (3.38) 
Ou Ox Ox 


x = Ae™™ + Be™%. 
For any path, we write 
x=xt+y 


so y(0) = 0, y(U) = 0 and the integral over u becomes 
U 2 
| 5 (x+y? + 7 Ww 4 oa du 
en ee 2 
U 
— | 5 Le + 2p + p? + wx? + w?2xy + wy? du 
O 


Um : m Me ag 2 
= | = (x? + wx?) du + a (2x + 2w*xy) du 
: | 


0 


m sd 2 2.2 
+ — (y* + wy”) du. 
2 Jo 
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Now 


U U 
| 2xy du = 2yx 


0 


Because y(0) = 0 and y(U) = 0, we have 


U U 
| Axy + w*xy) du = | 2y(—x + wx) du = 0, 
0 0) 

since x satisfies the Euler equation. Thus, by expanding about the classical path, 
we have eliminated the integrand linear in the deviation. Thus we may write 


U 2 
f(X2, X,; U) = exp © | & x7 + — *) iu] F(U), 
O 


F(U) = {| exp ee {, (5 jy? + me »*) iu Dy(u), 


with y(0) = Oand y(U) = 0, is a factor independent of the endpoints x, and x5. 
Thus expansion about the classical path allows us to separate the path integral 
into a factor dependent upon U and the endpoints, and another path integral 
dependent upon U alone. Thus the important dependence on x, and x, can be 
found by simply solving the minimizing differential equations (Eq. 3.38) subject 
to the end conditions x(0) = x, and x(U) = x, and calculating the integral 


U 2 
mM gz 4 MO ye?) du. 
me 2 


In fact even this integral can be simplified. We have 


where 


U U U 
| 7 du = 7 xi —t xx du. 
0 0 0) 
Therefore 
U U U U 
™ (x? + wx) du = ™ xt + ll (—x + w*x) du = m” xx| * 
0 2 2 0) (0) 2 2 0) 


As a new problem, suppose we have two interacting systems, one of which is 
a harmonic oscillator. Let x describe the position of the oscillator and g that of 
the other system. A possible Hamiltonian is 
mx mo* , . M@q? 


* You can compare your result with Eq. (2.83). 
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The partition function is 


Q = {| exp & \, es + mex _ ya(wx(0) au 
x exp os {, 3 + v(a)) iu Dx Dq. 


We cannot, in general, evaluate this path integral, but we can integrate out 
the x variable. To do so we must find 


F[q(u)] = {| exp -; |, ea + mee _ 10x) du Dx 


This integral can be evaluated by the same method as that used for a har- 
monic oscillator. More generally, we can evaluate 


F Lf; x, x'] = {| exp E {, (" + mer + TOO) au Dx. 


x(0)=x 
x(U)=x’ 
The result is | 
FU; x, x’) = Vmo/2nh sinh aU e7%, (3.39) 
where 
e7olu-u'| Vid a 
o-7 |, ix LO eee 2 sinh @U 
x [(x? + x’?) cosh aU — 2xx’ + 2A(xe?” — x’) + 2B(x'e?” — x) 
+ (A? + Be — 2AB], (3.40) 
where 
A= [- c-op(u) du, 
2mo@ 
. U 
es [ e- 9O- Fy) du. (3.41) 
2mo@ Jo 
Now let 
° U 
e[f] = (exp a | fw)x(u) iu) 
harm. osc. 
_ SFG: x, x) dx (3.42) 


—  F,(0; x, x) dx 
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Eventually one will arrive at the result: 


GLf] = exp Ee {, {, cosh tle — eT 3Y) pen ftu ae au'|. 


(3.43) 


If we define f outside the interval 0 < u < U by letting f be periodic with 
period U, we may write 


Sif) = ep ee {- {. oo IF(u) fu) du iu’| (3.43a) 


An alternative way of finding F,(/; x, x) is to Fourier analyze f(u) and x(u): 


x(u) = ay + Le (a Cos a + b, sin 7) 


fu=fo+ ¥ (1 cog oan a"). 
= B B 
The path integral 


F,Lf] = | F,(fs x» 8) de 


can then be written as 


F.[f] = iP | da da,:+:db, db,:°: 
«ej (SE + Mw + veont) | 


and can easily be evaluated.* 


* The equation 


1 — cosnO _ ™ cosh (x — |A\)a 


2a” n=1Hn* +a? 2a sinh za 


is useful. F; [f ] as found by the above method has an unknown multiplicative constant 
as a consequence of the Jacobian for the change of variables implicit in Dx(u) > 
da, -::db, db.--- ; but Eq. (3.43) can be found unambiguously by the above method. 
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For a free particle, we can find F,[f] either by letting m > 0 or by direct 
calculation. The result is 


{| exp be {, [4mx(u)? + if(u)x(u)] au Dx(u) 
) 


'| en |. f(u) du exp ve ( \. |. lu — u'|f(u)f(u') du du’ 
9) U 2 
+ ? | uf (u) au ) . (3.44) 


3.3 PATH INTEGRALS BY PERTURBATION EXPANSION 


We will make use of this last result later. 


Suppose that we have found the density matrix element p,.(x., x,, U) for some 
system of potential V,, and we wish to know p(x,, x,, U) for another system with 
potential V> + V’, where V’ « Vo. Then 


p(x2,X,,U) = {| exp I; {, ke + V(x) + x) au 2x| 
0 
U “2 
- {| jex 1-5 \ kes + Yo iu} 
1 U 
x exp I; | V(x) aut | 2D, 
Expanding the second exponential, we have 
= {| exp I; \, ks + ro aul 
x -- An V'[x(u)] du + att, V'[x(u)] du) + + [9s 


= Po — ; {| : exp = | ea + ve iw'| 


x exp = i; es + ro | iw'| V'[x(u)] du Dx +°°: 


Le 1 (*[ mx? ; 
= Po — ‘|, du {|e eit i + ro | iw'| 


x V'[x(u)] exp = a kes + vo iw'| +... 
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xy 


Fig. 3.3 Integrand of first-order term 


In order to calculate the path integral, we first integrate over all paths for which 
x(u) = y (with x(0) = x,, x(U) = x,) and then integrate over y. 


1 U 
p(X2,x,U) = po — i | | Po(x2, y, U — u)V(y) poly, x1, u) dy du +->: 
0 ally 


Pictorially, the integrand may be represented as shown in Fig. 3.3. 

We can think of the zeroth-order term, p,, as being the contribution to the 
density matrix that results from the particle going from x, to x, in “time” U 
without being affected by the potential. Then the first-order term, which has its 
integrand represented in the Fig. 3.3, may be thought of as the contribution 
resulting from the particle being essentially free, but being “‘scattered”’ once at 
some “‘time’’ uw and some position y. Similarly, the second-order term can be 
considered as the contribution from two “scatterings”, and can be written as 


1 U u 00 oe) 
+— | du | dv | dy | dz 
h 0 0 — 0 =S100 
x Po(X2; Z, U= u)V (z)po(z, y,u — v)V(y)poly, X15 v) 
Pictorially, this integrand may be represented as shown in Fig. 3.4. 


| Notice that we have not found any new results. We have already derived 
this perturbation expansion without the use of path integrals. 


Problem: Find to what order the nth-order integral expansion of p(x, x,, U) 
solves 


dp h? 67p 
—-— = -— —, + [V, + V' : 
a = 5 aa + [Yo) + V'@De 
knowing that 
0 h? a? 
Po = = FO + V4(x)p0. 


 @u——S*«m OX? 
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Fig. 3.4 Integrand of second-order term. 


3.4 VARIATIONAL PRINCIPLE FOR THE PATH INTEGRAL 
For the free energy we know the relation 


F< F, + (H - Hoyo (3.45a) 
or 


F < (H), — TS. (3.45b) 


This is written in terms of the Hamiltonians H and H,. We ask ourselves if we 
can write a corresponding minimum relation using the path integral. 
In the path-integral form, the partition function is written as 


e FF — e PMID (yu), (3.46) 
x(0)=x(U) 


Suppose we have another “S’’, So, that is easier to work with. (Here Sy does not 
mean entropy). Then Eq. (3.46) may be written as 


—(S—So),—S 
—BF __ {J e ( oe 2 Dx eo BFo 
= 9 


{J e~°? Dx 


(3.47) 


where 
e PFo = {| e~ °° Dx. (3.48) 
The first factor of Eq. (3.47) has the form of an average of exp [—(S — So)] 
with e~*° as the weight for a certain path x(u). We may write Eq. (3.47) as 
e~FF = (e-S~So)y, eg BF o, (3.49) 
Now we suppose that S and Sp are real and use the inequality 


Cent ye eS, (3.50) 
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Fig. 3.5 Geometrical interpretation of (e~7) = e7. 


The geometrical interpretation of this relation is as shown in Fig. 3.5. In Fig. 3.1, 
<e~F is always above the curve e~/. Note that Eq. (3.50) does not depend on 
how the /’s are distributed. When we apply Eq. (3.50) to Eq. (3.49) we can write 


e~BF > @- S-S0) g-AFo. (3.51) 
wnere 
[J S = Soje™*? Dx 
($ — Spy = UL ni oe. : an 


Usually Eq. (3.52) is simpler to calculate than the first factor in Eq. (3.47). We 
thus have a theorem: 


F< Fy + 4s — Sods, (3.53) 
Exercise: When 
S= |, kw + vextw) du, 
(3.54) 


So = \, ke + ral) du, 
0 


3 - Soo = <V — Varo. (3.55) 


show that 


Thus Eq. (3.53) contains Eq. (3.45) as a special case. (Hint: from Eq. (3.54)), 
p 

$5 | (V — V) du. (3.56) 
0] 


The numerator of the right-hand side of Eq. (3.52) becomes an integral over u, 
the integrand of which is independent of wu. 
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3.5 AN APPLICATION OF THE VARIATION THEOREM 


Finally in this chapter let us explore an application of our variation theorem, 
Eq. (3.53). 
In a one-dimensional, one-particle case, we have 


ee \ : a + v(x) du, (3.57) 
h\,| 2 
where 
U = Bh. (3.58) 


To avoid unnecessary clutter, we will set 4 equal to unity in most of the following 
work. Every once in a while we will restore the f’s, so that the reader can see 
where they belong. 

The partition function may be written as 


U ° 2 
e7 FF exp -| en + v (x(a) aul Qx(u) dx(0). (3.59) 
0 

Here, we first fix x(0) and x(U) = x(0), and then do the path integral over all 

paths; then we vary x(0). The zero in $ indicates that we integrate over x(0). 
In the classical limit of high temperature, or small 4, U is small, so that the 

path does not deviate much from the initial point x(0). Thus the first approxima- 

tion in Eq. (3.59) is to replace V(x(u)) by the initial value V(x(0)) to obtain 


Pal ae i | e-VOVMKT gy. (3.60) 
7 


This is the well-known classical result. 
How can we improve on this approximation to take the quantum effect into 
account? 
First we observe that, because x(0) and x(U) are equal, it appears more 
natural to use an average ; 
x= : | x(u) du (3.61) 
U Jo 
in place of x(0). Second, the deviation of the path from the classical straight line 
may be taken into account by some average of V(x) over the path rather than by 
a constant V(x(0)). These considerations lead to the following trial So: 


U ° 2 
So = | lis) REO ot (3.62) 
9 2 
Here w(x) is a still undetermined function, which is to be varied later to minimize 
Eq. (3.53). In the first approximation one would expect to choose w(x) = V(x) 
but we can do much better with our variational principle. 
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e~ FFo — {| Qx exp -|. kes + Uw) du 


all closed paths 


= [ax {| exp ea ie du + Uw | iu] Dx. 


all closed paths x 
with fixed average x 


Now let y = x — x(0). Since p(u) = x(u), we obtain 


U 2 
e PFo — [ax e~ Uw») {| exp Bl a Dy. 
0 


all closed paths 
y with (0) = 0 


In the path integral above, there is no restriction on y because y = x — x(0), 
and we integrate over x(0). The reasons for integrating over the paths chosen can 
be made clearer by drawing diagrams; the reader should do this if he needs 
clarification. 


e~FFo — | dx e~ Uw@)y. (0,0; U - m_ | e UW) dy, 3.63 
| Pew(0, 05 U) = Jo | y. (3.63) 


It should be noted here that in the classical case Eq. (3.60), w(y) is replaced by 


V(y). 
Next, we calculate <S — S)>. Using Eqs. (3.57) and (3.62), we see by 
definition 


<5 — So) 
_ {ABS — So) exp [—J mx?/2 du] exp [— Uw(x)] 9x 
{J exp [—J mx?/2 du] exp [— Uw(x)] 9x 


_ Vy Glu Jo V(xtu')) du’ — w(x)} exp [—J mx?/2 du] exp [—Uw(x)] Dx 
.{] exp [—J mx?/2 du] exp [— Uw(x)] Dx 


(3.64) 


Here x is defined in Eq. (3.61) and is functional of x(u). The first term in the 
numerator is simplified as follows: 


Ve {, V(x(u’)) du’ exp & ce mi(uy’ iu exp [—Uw(x)] 2x 
=| 5 an all V(x(u’)) we = mi(uy’ au exp [— Uw(x)] 9x 
at a |e V(x’) {| exp rp mw) du | mw) iu 


ay soe 


x exp [ —Uw(x)] Dx(u). 
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yo0=—— 4525-5555 a 


Fig. 3.6b 


We are integrating over paths like those in Fig. 3.6a. By making the change of 
variables 

ven O<su<u' 

u<u<wU 


u— uU 


we integrate over paths like those in Fig. 3.6b. Because of the condition that 
x(u = 0) = x(u = U), we do not get a discontinuity in x at V= U—w'; 


therefore 
U 32 U 32/7 
| mw) Fie | me) dv. 


0 0 


Also, 
x XU u x(V V. 
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The first term in the numerator of Eq. (3.64) becomes 


{, du dx'V(x') {| exp [-[ Pav | exp [ — Uw(x)] 
0) U . : 


x(0)=x(U) =x’ 


V(x(0)) exp |. me) au exp [— Uw(x)] Qx(u). (3.65) 
x(0)=x(U) , 
Thus Eq. (3.64) may be written as 
: (S ~ So>sy = <V(x()> — <w(x)). (3.66) 


To transform further we use the following result (proved previously): 


{| exp | - * mu)” du | ex | \, F(u)x(u) du Dx(u) 
ps {| flu) iu) oP ban | {, CDFG did 
+ ath uf (u) iu) |} (3.67) 


In evaluating Eq. (3.65) we first do the path integral keeping x fixed; then 
Eq. (3.65) is written 


{| V(x(0)) exp et mau du exp [ — Uw(x)] Dx 


| K(y) exp [—Uw(y)] dy (3.68) 


where 

K(y) = {| V (x(0)) exp | - \, miu) iu Dx (3.69) 
such that x = y. Equation (3.69) may be written 

K(y) = {| V (x(0)) exp es \, muy iu 5(x — y) Bx(u). (3.70) 


For this we use the Fourier transform: 


V(x) = | ecae dq (3.71) 
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to write Eq. (3.70) as 


K(y) = | CCE | dk {| exp [iqx(0)] exp [ik(x — y)] 


U p02 
x exp |-| a du Dx. (3.72) 
1) 


Here 0(x — y) of Eq. (3.70) was also Fourier transformed, and we are ignoring 
factors such as 2x. Use Eq. (3.61) in Eq. (3.72) to write 


K(y) = [oa dq | ak exp [ —iky] {| exp [iqx(0)] exp : e \, x(u) au 


U °2 
x exp |-| aa au Qx. (3.73) 
Q 


Now we see that the path-integral part of this expression can be brought into the 
form of Eq. (3.67) if we define 


flu) = qo(u — 0) + (3.74) 
because 
U k U | 
| f(u)x(u) du = qx(O) + 7 \ x(u) du. (3.75) 
0 0 
When we use Eq. (3.74) in Eq. (3.67) we see that 
[7 du=qtk, (3.76) 


\, wane (3.77) 
; 2 
\, \, lu — u|f(wf(u’) du du’ = . “i (u — u’)f(u') du’f(u) 
x — u’) - du’ + “| f(u) 
U | 


2 iy f= rt va |S) = Ke Sau: 


(3.78) 
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Use Eqs. (3.76), (3.77), and (3.78) in Eq. (3.67) and also in the path-integral part 
of Eq. (3.73) to write the latter as 


U fk k2 
K(y) = | dqv(q) | jcc (ORG ees 2 (‘ er a) 


3 2 
: k?U 
= { daea) | ae exp | —iky]6(q + k) exp Sam 
24m 
= | ate k) exp [—iky] exp eo . (3.79) 


Using the inverse Fourier transform, we find that Eq. (3.79) becomes 


K(y) = {at exp [—iky] [ve exp [ikz] dz exp | - | 
24m 
= [eve exp Gs (y mal (3.80) 


When the correct factors are taken into account, the final form is 


Ko = | dzV(@) exp| - seen eed | (3.81) 


It should be noticed that K(y) is V(z) averaged over a Gaussian. The root mean 
square of the Gaussian spread is 


h 


ro (3.82) 


This spread is about 1 A for a helium atom at 2°K, and is a very narrow spread 
at room temperatures. At the limit of infinite temperature, the Gaussian becomes 
a delta function, and K(y) > V(y). 

Summarizing the results obtained so far, we have 


1 
F<sFo + 3° — So) sos (3.83a) 


| exp [—Bw(y)] dy J ae (3.83b) 


_ J (KO) = w(y))e7?" d 
76S — Sods a 


e bFo 


(3.83c) 
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Equation (3.83b) is Eq. (3.63). Equation (3.83c) follows from Eqs. (3.64), (3.65), 
and (3.68). Further, K(y) in Eq. (3.83c) is given in Eq. (3.81). 


Now we ask what is the best choice of w(y)? Changing 


w(y) > w(y) + n(y) (3.84) 
where n(y) is small, we see from Eqs. (3.83b) and (c) 


_ Sne~*" dy | 
5Fy = TePay (3.85a) 
1 _ fe ’*[-—Bn(K — w) — ndy] , [e-*"(K — w) dy | Bne~*” dy 
6S = 5 2 ei ee 
Po? e~ 4 dy . ({ e~ 8" dy)? | 
(3.85b) 
Thus 
5 (Fe + 5s - S0)s.) =, (3.86) 
leads to 
w(y) = K(y). (3.87) 


This is the best choice of w(y). 
In this case Eq. (3.83c) yields 


| <S — So)s, = 9, (3.88) 
so that, from Eq. (3.83a), 
F< Fux (3.89) 


where F., , means the classical free energy with the potential V(y) replaced by 
K(y). Alternatively, 


7 bFe1 x - | mM | e-BKO) g 3.90 
2nhU ae 


compare Eq. (3.60). K(y) is defined in Eq. (3.81). Fy x is a better approxima- 
tion than the ordinary classical form F,, in Eq. (3.60). 


Example: To find out how good Fy x is, let us consider the harmonic oscillator. 


In this case the potential is 
pe A 


Vix) = me. (3.91) 


i) 


From Eq. (3.81) 


2 2 2 
K(y) = fae z* exp Pai § (y — z)*|dz= tae y> + es 
TC 12m 


(3.92) 
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Using this result in Eq. (3.90), we obtain 


—BFclK _ m = mo” 2, WB d 
: Jip [| Pe (» 12m) | “” 


= (1/ABo)e*P?07/24, (3.93) 


Thus, we have the following results: 


| ee : In 2 sinh ex : (3.94a) 
B 2 
_ 1 fy2 B* 
| R Ec (Awp) + 7 | (3.94b) 
Pe m (Fico). (3.94c) 
When we write 
G= ae and f= - (3.95) 
Eq. (3.94) becomes 
Give = : In (2 sinh f), (3.95a) 
2 
Gan = ION +, (3,95b) 
Ge : In (2f). (3.95c) 


Table 3.1 is a numerical comparison of G,,,., Gy x and Gy. This table shows 
the remarkable improvement Gc, x has over G,. Remember that f = 4 cor- 
responds to the temperature kT = hw where the quantum effect is large. 


Table 3.1 

f= 1/2 f= f= 
Gerue 0.08263 0.8544 0.9908 
Gak 0.08333 0.8598 1.0264 


Go 0 0.6931 0.6931 
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The expression F,, , in Eq. (3.90) combined with Eq. (3.81) is better than the 
series expansion of K(y) in terms of V(y), d*V/dy’,... 

Nevertheless, F., x is not as useful as Table 3.1 might indicate. First, it 
cannot be used when quantum-mechanical exchange effects exist. Second, it 
fails in its present form when the potential V(y) has a very large derivative as in 
the case of hard-sphere interatomic potential. 


CHAPTER 4 


CLASSICAL SYSTEM OF WN PARTICLES 


4.1 INTRODUCTION 


The classical partition function of N interacting particles is written as 


1 m 3N/2 
a Zn, (4.1) 
: N! (5-575) x 
where 
Zy = pemmne TEN AR: d°R, oe aR (4.2) 


These equations can be derived from the quantum-mechanical results we have 
obtained previously. From Chapter 2, Section 8, we have 


e7 BFs = Nie | Pov Xo PX,,..., PXy) dX,:::dXxn 
ce sg 
for Bose statistics. For Fermi-Dirac statistics, 
e7 Pa — (= Do Xis eo XN; | 5, Seer PXy) dX,:°:dXy. 
: P 


In either case, only the identity permutation is important, because at 
sufficiently high temperatures, factors such as e~™*7/2#*(X—-PX)” kill the other 
terms. It follows that: 


en FF ny a o(Xrs--+> Xni Xtyee+> Xy) dX, +++ dXy. 
We have already estimated this integral for the case N = 1 by the use of 
path integrals (see Chapter 3, Eq. (3.60)). Generalizing the method used to the 
case of N particles in three dimensions, we arrive at Eqs. (4.1) and (4.2). 
When a gas particle is a polyatomic molecule, the internal motion of a 
molecule and the motion of the center of gravity can be separated; then the 
specific heat is a sum of the two contributions. In Eq. (4.2), R; is regarded as 
the center of gravity of the ith particle; the partition function for the internal 
motion is not included here. The latter can be calculated using information 
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about the energy levels obtained from the infrared spectra, or calculated using 
quantum mechanics. 

For a dense system such as a liquid the internal motion of polyatomic mole- 
cules and the motion of center of gravity are entangled with each other, and the 
two are hard to separate. 

The following discussion excludes the internal motion of a particle, and the 
system is regarded as a gas or a liquid of inert gas particles. 

The free energy is written from Eq. (4.1) as 


2 
pet ee pe ae (4.3) 
m B Bee 


The e in the last term of Eq. (4.3) is the base of natural logarithms, rather than 
the electronic charge. This e appears because we used the Stirling formula: 


) — 
N! &. J2nN >In N! % N (1 ®): (4.4) 
e e 
From Eq. (4.3), we have for the internal energy U 
2 
uy = —T? MIP) _ arp 4 BE Okn (4.5) 
OT NZ, OT 
where 
R = Nk. (4.6) 


The equation of state is found to be 


PV _ Vay 
RT NZy @V- 


(4.7) 
from Eq. (1.6). 
Example: For an ideal gas, V(R;,..., Ry) = 0; therefore Zy = V™ and 
Eq. (4.7) becomes 
| PV = RT. (4.8) 
When we assume that the potential energy can be written as a sum of pair- 


wise potentials: 
V(RyR2 +++ Ry) = LV (rij), (4.9) 
LJ 


pair 


Eq. (4.2) can be written 


Zy = [ew |-# > veri) | d°R, d°R,--+-d>Ry. (4.10) 


pair 
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V(r) 


no 


Up 


Fig. 4.1. An unrealistic potential; v9 is the volume of a cluster of N particles. 


Note that this expression contains all the information about the phase transitions 
of the system. In other words, the nature of the phase transitions can, in principle, 
be discussed by examining the behavior of the integral in Eq. (4.10) purely 
mathematically, without physical knowledge. 

Equation (4.10) suggests that when the number of particles N is finite there 
is no discontinuity in physical qualities. Only when N — oo do we expect 
discontinuities, and this is the case we are interested in. 

The assumption of a central force potential V(r;;) in Eq. (4.9) is not exactly 
justified. The necessity of a noncentral force is illustrated by the following 
observation: In the solid state, if we assume the nearest-neighbor interaction 
only, the face-centered cubic and the hexagonal close-packed structures are not 
distinguishable. When the second nearest neighbors are taken into account the 
central-force assumption leads to the conclusion that hexagonal close packing 
has the lower energy, whereas face-centered cubic is actually more stable for 
solid argon for example, which is inert in its gaseous phase. 

The free energy F of Eq. (4.3) is not necessarily a function of V/N only. 
If we take an unrealistic potential like the one in Fig. 4.1, the energy of the 
system is roughly of the form CN”, where C is a constant and N is the number of 
particles forming a cluster of volume vy. Thus, we can think of two extreme 
forms of Zy: 

(1) — ,—BN2C,,N 
oe (4.11) 
22) = VN, 
When the temperature is low, for large N, the contribution of ZY can be larger 
than Z{); then F is not a function of V/N only. 
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4.2 THE SECOND VIRIAL COEFFICIENT 


When the gas is not dense, the data of the equation of state are summarized in 
practice as 
PV B C 
Se Se a (4.12) 
RT VOY 
where B is called the second virial coefficient, and C the third virial coefficient. 
In this section we will derive an expression for B. We assume a pairwise potential 
as in Section 4.1. 


Consider a system of N + 1 particles. In this case, Eq. (4.10) of Section 4.1 
becomes 


Deg = [ew E Vx —- Rd | exp |-6 ps ver d°X d°%R. (4.13) 


When we integrate over X first, all the R,’s are fixed. If the gas is not dense, 
the volume for which V(X — R;) is appreciably different from zero is small; so 
we May approximate 


Zya1 &% VZy. (4.14) 


Equation (4.14) will lead to the ideal-gas equation of state. To obtain a better 
estimate of Zy4,, we write Eq. (4.13) as 


a Texp[—-B San Vrid™R- 


x _ |-p >, ver) | d>*R. (4.15) 


pair 


The ratio part of this equation has the form of the weighted average of 
| exp [-B DL V(X — Ry] ax. 


We will evaluate this average by neglecting the three-body and hi gher-order 
collisions. When _X is close to R;, this pair is assumed to be far away from any of 
the rest of the R,. This assumption is equivalent to replacing the weight factor 


exp | 6 y Vr; d| 


by unity (see Fig. 4.2). When X approaches R; in Fig. 4.2(b), we have a three- 
body collision among X, R;, and R;. Therefore, we neglect the configuration (). 
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Weight: 1 Weight: e 
(a) (b) 


-BV(R,—R,) 


Fig. 4.2 Weighting factors in the derivation of the ideal gas equation of state. 


Thus, when we neglect the three-body and higher-order collisions, we can 


approximate Eq. (4.15) as 


~ Jexp[-B¥, V(X — R)] d?X d°*R 
cs) 


_ { d°X [{ exp [—BV(X — R)] d°R]" 7, 


yN 


Now, we make the transformation: 


femomapay [tr ememnyae 
When we write the second term as a, 
a= | [1 — e PY ]4nr? dr. 
1) 


Using Eqs. (4.17) and (4.18), we write Eq. (4.16b) as 


VV — a)% 
Zar =O py 


N 


(4.16a) 


(4.16b) 


(4.17) 


(4.18) 


(4.19) 


This is a recurrence relation for Zy. We can solve for Zy from Eq. (4.19) as 


follows: 


Zn = 


| 
< 
"oN 
ba 
| 
N|& 
nn” 
2 
I 
2 


(4.20) 
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Multiplying all the equations in Eq. (4.20), we have 
a (N-1)+(N—-2)+°--+4+1 a N(N-1)/2 
Zy =V* ( — 4 = VN (: — r) . (4.21) 


When N is very large, it can be approximated as 
Ze Ve eee, (4.22) 


This exponential part is interpreted physically to mean that each pair contributes 
a factor e @/Y), 


The equation of state is derived from Eqs. (4.22), and (4.7): 


2 
PV __VolnZy_V a Ninv — 24 (4.23) 
RT N_ ov N oV 
or 
PV/RT = 1 + Naj2V (4.24) 


Comparing this with Eq. (4.12), we obtain the expression for the second virial 
coefficient: 


B= Na/2. (4.25) 
This is an exact result, in the sense that the three-body and higher-order col- 


lisions that we have neglected have effect only for the third virial coefficient and 
higher-order terms of Eq. (4.12). 


Example: Let us consider a potential as shown in Fig. 4.3. Then Eq. (4.18) is 
evaluated as 


b 9) 
a= | (1 — 0)4nr? dr + | [1 — ef 9 ]4nr? dr. (4.27) 
0 b 
Vir) 
O b ’ 
o(r) 


Fig. 4.3 A hypothetical potential. 
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When the temperature is high we may approximate Eq. (4.27) as 


3 fo) 
pe Oe oar ae (4.28) 
3. KT |, 


which we may write in the abbreviated form: 
a= Up = Tvs (4.29) 


When we use this in Eq. (4.24), 


PV _44N%_ ND (4.30) 
RT 2V=«o2WT 


We can find out if this expression fits experimental data well by comparing 
it with the Van der Waals equation of state, which is a good fit to the data. The 
Van der Waals equation: 


(P + a,/V?\(V — b,) = RT, a 
can be expanded as 
_ 2 
BE eae fy Te cee (4.32) 
RT V V 


If we set NV,/2 = b, and ND/2 = a,/R, we arrive at Eq. (4.30) up to the 
second virial coefficient. If the Van der Waals equation is valid, it gives a second 
virial coefficient of 


B, = b, —a,/RT. (4.33) 


A special feature of this expression is that, because a,, is positive, B,, 
becomes negative for low temperatures. Experimental data for helium are 
drawn in Fig. 4.4. The slight decrease for high temperatures does not follow 


23° K 


Fig. 4.4 Experimental data for helium gas. 
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from Eqs. (4.33) or (4.30). It must be remembered that Eq. (4.30) is based on the 
hard-core assumption of Fig. 4.3. In actuality, the potential V(r) in Eq. (4.18), 
which is accurate, is slightly soft inside r = b, so that for high temperatures, 
when particles are hitting each other hard, they can come slightly coser than b. 
Thus there is a decrease in the “‘hard-core”’ radius, and hence a decrease in B. 


Numerical results: When we use the Slater potential 

V(r) = e %" — c,/r®, (4.34) 
or the Lennard—Jones potential 

V(r) = A’/r™ — B’/r®, (4.35) 


in Eq. (4.18) we can calculate a. Incidentally, in Eqs. (4.34) and (4.35), the r~° 
term comes from the Van der Waals attraction. 
Table 4.1 lists the numerical results for helium gas (in units of cc/mol). 


Table 4.1 

T°K *Beaic (CL.) Boos TBeate (Q.M.) 
350 10.80 11.60 10.82 
250 11.34 11.95 11.16 
100 10.75 10.95 

35 4.80 | 4.44 

20 —6.95 — 4.00 —5.14 

15 — 20.0 — 14.0 —15.1 


* The Slater potential is used. J. G. Kirkwood, Phys. Rev. 37, 832 (1931). 
+ J. G. Kirkwood, Phys. Zs. 33, 39 (1932). 


The quantum-exchange effect on the second virial coefficient was neglected 
in this section. This effect becomes important only for temperatures below 
about 1°K. 

The general references on the subject of the second virial coefficient are: 
J. de Boer, Report on Progress in Physics, Vol. 2, p. 305 (1948-1949). Mayer- 
Mayer, Statistical Mechanics (John Wiley & Sons, Inc. New York, 1940). 


Problem: For a mixture of two gases with atomic fractions X, and X,, such that 
Xx + X> = l, 
show that the second virial coefficient is 


B a B,,X? + 2B,2X1X2 + Ba3Xs. 
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4.3 MAYER CLUSTER EXPANSION 


From Eq. (4.2), we know that the classical partition function for an N-particle 
interacting gas 1s 


1 (mkT\3X/2 
-pF _ + 
: N! oe 2n ene) 
Zy = | en PV (Ru Ra Rx) dR d3R, +++ d3Ry. (4.37) 


The equation of state can be written, as in Eq. (4.12), as 


Stee hae, (4.38) 
RT VV 

This is the virial expansion; as before, B is the second virial coefficient, C the 
third, and so on. 

This formal expansion can, in principle, give all the virial coefficients. It 
was originally hoped that this expansion would yield the triple point, and 
related phenomena. This was not to be. For reasons to be discussed later, the 
formal program, which we now review, has not proved practical. 

In the following work, it is implicitly recognized that the /th virial coefficient 
is due to /-fold clusters. 


V(R,, R2,..., Ry) can be written (we assume) as }) V(r;,). 


Let 
Wy(Ri, Ro,... Ry) = exp[—-B Y V(ri,)].- (4.40) 
If there is one particle, then 
W,(R;) = e” F(9) = 1. (4.41) 


For two particles, 
W,(R;, Rz) = e PVC) 
W3(R;, Ro, R3) = exp {—B[V(r12) + Virz3) + V(ris)]}, ete. (4.42) 
Let 
W,(Ry) = U, (Rj). (4.43) 
If there are N points characterized by W,(R,,..., Ry) and if points 1 to M 


are far from points M+ 1 to N, Wy(R,,..., Ry) © Wr(R,,..-, Ry) X 
Wy—m(Ru+i,---» Ry). For example, if N = 2 and R, is far from R,, 
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W(Ry, Ra) © U,(R,)U,(R,). If the points are close to each other, this approx- 
imate equation must be corrected by the addition of a quantity, called U,(R,, R>), 
that is large only when R, and R, are close. 

Thus W,(R,, R,) can be written as 


W(R,, Ro) = UCR) U,(R2) + U2(R,, Rp). (4.44) 
Similarly, W;(R,, R,, R,) can be written as 
W3(R,, Ro, R3) = U(R,)U(R2)U(R3) 
+ [U,(R;, R2)U,(R3) + U2(R2, R3)U4(R;) 
+ U,(R,, R3)U,(R2)] 
+ U,(R,, R2, R3). (4.45) 


U;(R,, R2, R;) is the contribution due to all three particles close to one 
another; U;(R,, R,, R3) ~ 0 unless the particles are “clustered.” 


Wy(R,, R>,..-, Ry) = U(R,)... U(Ry) + °° + Un(Ry, Ra,---, Ry). (4.46) 


There is another way of expressing the above equations. If we set U,(R;, Rj) = 
fi;, then e~ 9°") = 1 + f,, and, from Eq. (4.40), 


Wy(Ry, Roy... Ry) = T] A + f))- (4.47) 


i<j 
For example, if N = 3, 
W3(R,, Ro, R3) = 14+ fio + fis + fos + Siahfe3 + Sisho3 
+ fiohis + fiofoeshis- 


We can express each individual term in this sum as a graph consisting of 
points and lines. If N = 3, then 


3 3 3 3 
lo ae Iie — = Si2ho3 baa Si2So3hi3 aad PAG 
1 2 1 2 1 2 1 2 


If G represents a graph, and W(G) is the integral over all space of its cor- 
responding function, then 


Zn = X W(G), (4.48) 


where the sum is taken over all possible graphs with N points. For disconnected 
graphs, such as 
Se 3 
1 2 
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W(G) is a product of its disconnected parts. A cluster of N points is represented 
by a sum ofall possible graphs in which those points are connected. For example: 


3 3 3 


3 
U3(R,, Ro, R3) = ak + J/\ ate o or TX. 


12 1 2 2 tt 2 


The Uy can also be expressed in terms of the W’s. From Eq. (4.43) U,(R,) = 
W,(R,). From Eq. (4.43) and (4.44), 


U,(R,, R2) a W(Ri, R3) = W,(R,)W,(R2). (4.49) 
From Eqs. (4.45) and (4.49), | 
U3(R,, R,, R3) = W3(Ri, R,, R3) = > Wi(R,, R,)W,(R3) 
+ 2W,(R,)W2(R2)W3(Rz), (4.50) 
where 
Y W(R1, R2)W,(R3) = W2(Ri, R2)W,(R3) + W2(R2, R3)W,(R1) 
+ W,(R;, R3)W,(R2). 
In general, 
UR’) = U(R,,..., R)W(R,,...,R) — > W(R;, R,)W,-2( Re selng depuae Or 
+ (-1)*0 — 1)! W(R,)- ++ W, (RD, (4.51) 
Here the coefficient for a term with k groups is (—1)*~'(k — 1)!. 


Thus, from Eq. (4.40), all the Wy are known. From Eq. (4.51), U(R’) in 
terms of the W; is given; thus the U,(R') can be found. (4.52) 


In any particular partition of the N particles, there will be m, clusters of 
size /; and of course >°; Im, = N. 
For example (as is evident from Eq. (4.45)), a three-particle gas can be 
partitioned into 
1. 1 cluster of 3, m; = 1,m, = m, = 0. 
2. 3 clusters of 1, m, = 3,m, = m, = 0. 
3. 1 cluster of 2 and 1 cluster of 1 (this can happen 3 ways), m, = m, = 1, 
M3 = 0. 
From the above discussion, with Eqs. (4.45) and (4.46), 
W,(R*) = Som) I] U,(R’) > lm, = N. (4.53) 


Sim) Means the sum over all possible partitions of N, that is, all divisions of 
particles into groups. 
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_ From Eggs. (4.37), (4.39), and (4.40), 


Z W, 


Now construct the expression 


1 
b, = at UR’) dR, pe dR,. (4.55) 
This equation is known as Mayer’s cluster integral. From Eqs. (4.51), (4.53), and 
the statement labeled (4.52), 


1 
Xwy = N! S (nz) I] | UR’) 


1 sa 
= N! Son) I] VII b;) e 


But if N = 5, for example, U(R,R,)U(R;, R,)U(R5) gives the same contri- 
bution as U(R,R;)U(R,R;) U(R,), which is a different partition with the same 
set of m,. For each possible set of m, there are several terms, all of which are 
equal. To see how many such terms there are, note that there are N! permuta- 
tions of the N coordinates, and each such permutation corresponds to a term 
iN S(m,) With the same set of m,. But we are counting each term more than once. 
We should not count different permutations of the coordinates within a given U, 
for example, U(R,, R,) = U(R2, R,). We also do not count changes of order 
of the U’s as separate contributions. For example, U(R,, R,)U(R3, R,)U(Rs;) = 
U(R3, R,)U(R,, R,)U(R;) are not counted separately. Thus for a given set of 
m,, there are N!/[], (/!)"'m,! terms. So we can write 


sp Rs 2,8 (UE by ny 


N! N! all possible sets “1 (I ty"'m,! 
of mi: <XIlm, = N 


Xw 


(aan 


- (4.56) 
{mi} 1 m,! 


It is important to note that (for large volume), b, is independent of volume. 
That this is so can be seen as follows: U,(R') vanishes unless the particles are 
clustered. Holding dR, fixed and varying dR, ..., dR, then gives some number 
completely independent of volume, because unless R2,...,R, are near R,, 
U(R) = 0. 

The integral over dR, then gives a quantity proportional to the volume, V, 
and as b, was defined with a V, in the denominator, b, is independent of V. 

We must now evaluate Xy = Sem, [i (Vb,)™/m,! subject to}. ml = N. 
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Recalling a similar summation encountered earlier (for the quantum 
statistics of a many-particle system), we note that if the restriction }' m,/ = N 
could be removed, Xy can be easily evaluated. Using the same methods as before 


we define 
achis _ zs m 3N/2 Zy 
N! \2nh7B 
and | 
@ Pe Se PEN eo Ay (4.57) 
N N 
where 


en me _ eB 
2nh? B 


But first taking the sum }°(,,) with >’ m,/ = N and then summing over all N is 
clearly equivalent to summing over all m, with no restriction. 


m,,,lm 
a A el (no restrictions) 
m l m,! 
(Vb,)™0™ (Vby)"07" 
my,m2,° °° m,! m,! 
= exp ly Y an 
g = —kTV > a’b). (4.58) 
i 


This is known as Mayer’s cluster expansion. 
From Eq. (1.51) 


4) 
(y= -3 
Ou 
= +kTV ¥ lao’! “ b, = V ¥ lod, 
I Ou I 
Therefore the equation 
p=<NY/V = )) lab, (4.59) 


determines « if we know the density. 
From Eq. (1.53) the pressure is given by 
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Therefore we get the equation of state 


PV PV _ bp! 


RT NkT >, 1b,’ 


For small density, « is small—as can be seen from Eq. (4.59). Then a’t! « 
a'; so that expansion (Eq. (4.60)) need only be carried out a few terms. For 
small /, b, can be calculated easily. Inverting Eq. (4.59) to get « as a power 
series in 1/V, and writing 


(4.60) 


PV B 
—~=1+—+—4+°, 
RT V V 
we have 
B= - 2 N —— ale = e7 BB(ri2)) dR, (4.61) 
1 
c a Abs is — ap? — XN {2 _ g~AV(r12) _ g~BV(r23) 9 ~BV(r13) 
bi 3 
a Cpr a, dR3. (4.62) 


When the density becomes large, the Mayer cluster expansion is of little use. 
This is because at very high density (liquid), the most important term in the 
expansion becomes the Nth term. That is, almost every particle becomes a 
member of a very large cluster— and in a liquid all the particles are in an N-fold 
cluster. 


Hard Sphere Gas 


The question may be asked: What is the partition function of a hard-sphere 
gas—that is, a gas composed of small inpenetrable spheres that exert no forces 
on one another when not touching? Unfortunately, this question is still un- 
answered. The question may be stated in a different way. 


ZN a 1 aR: 


no two closer than 2a. 
a=radius of sphere 


But 
VN = AaNR. 
no restriction 


Thus { d"R/{ dR is essentially the probability that if N points are “dropped” 
into a volume V, no two particles will be closer than 2a. 
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4.4 RADIAL DISTRIBUTION FUNCTION 


Consider a system of N particles interacting with pair-wise forces. The partition 
function Zy 1s 


T= | exp k BY: v| d?NR, (4.63) 


pair 


The probability density for particle 1 being at R,, particle 2 at R, etc., is 


1 
ts exp [—BV(R,, re) Ry)], 
Zn 
where the factor of 1/Zy provides normalization. The probability that a given 
particle, say particle 1, is at R, and the rest of the particles are anywhere is 


(=) [exw [—BV (R,, rey Ry)| d°R, d°R;, si sar d*Ry. 
N 


Because there are N particles, the probability density for one of these particles 
being at R is 


N 
n,(R) = Zz. [ew [— BV (R,, R,, sey Ry) | d>R, pace d>Ry 


N 


N 
= — exp | -6 y ver) | d°R, eal d°Ry; (4.64) 
Zn pair 
n, 1s the one-particle density, or distribution function. 

The two-particle distribution function ,(R,, R,) is defined as 


pair 


n,(R,, R,) = a |e |p y v| d°R,°° -d°Ry; (4.65) 
N 


nz is the probability density for finding a particle at R, and another particle at 
R,. For a liquid or a gas, n,(R,, R,) is a function of the distance between the two 
particles. 


n(R,, R2) = na(r 12). 


The general shape of n,(r) is as shown in Fig. 4.5. When r > o0, n,(r) 
approaches nj where n, is the one-particle density; this is because when the 
distance becomes large, the existence of one atom does not influence the distribu- 
tion of the other atom. 

The radial distribution function n,(r) is related to x-ray or neutron diffrac- 
tion as shown in Fig. 4.6. The x-ray is coming in along the z direction and is 
scattered by atoms. The scattered x-ray is observed at P somewhere far from the 
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nr ) 


Fig. 4.5 Two-particle distribution function. 


Z | 
Incoming x-ray 
wave vector K;, R; 


\ ie unit vector 


Outgoing wave 
wave vector Kouz= Kk ip 


Q 


C, reference center 
somewhere in the liquid. 


We can define Z = 0 at C. P 


Fig. 4.6 X-ray diffractions. 


liquid. The amplitude observed at P is the sum of amplitudes scattered by each 
atom, the latter being of the form 


a(0)(e**/R). 


Here R is the distance from the scattering atom and a(@) indicates the possible 
angular dependence of scattering, which is not important here. Taking into 
account the phase of the incoming wave, e“’*, we write 


Amplitude = a(6) >) (e™*/Fe™*4/Rjp). (4.66) 
J 


P is the point of observation. Then, approximately, 


Rjp = Rep — R;° bp 


and we may write 
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So Eq. (4.66) becomes 


ikRcp ; : 
Amplitude = a(0) e pa e7 Rout’ RigiKin: Rj 
cp J 
or 
eikRcp 
Amplitude = a() [— - G, (4.67) 
with 
G= > exp Li(Kin ° R; _ Kou ' R,)] = 2 exp (iq , R;), 

J J 

where 


q = K,, — Kou. 
The intensity of the x rays is 
Intensity = |Amplitude|? = (\a(@)|?/R2p)|G|?. (4.68) 
To compute the intensity, we need to know the expectation value of |G|?._ Now 
. iq: (Ri—Rj,) 
|IG/? = oe ied 
For a liquid, neglecting the i = j terms in the double sum, we have 
{|G|2> = NO 1) om je [-B Pes staal PR (4.69) 
N 


because the term in brackets, [ ]|, is the probability for the configuration R,, 
R,,..., Ry. Now, using Eq. (4.65), we write 


{iGI?) = 4 | eit (Bi-Ran (R,, Ry) d?Ry d?Rp, (4.70) 


When n,(R,, R,) depends on |R,; — R,| only, V = the volume of the liquid, 
and q = |q|, 


<|G|?> = 4V [et¥nscxp d°X = ane: | mat sin gr dr. (4.71) 


45 THERMODYNAMIC FUNCTIONS 


In the classical case, if we know n,(r), we can derive all the thermodynamic 
functions. 
The internal energy is 


1 Oy 


U = 3RT — ‘ 
Zy Of 


(4.72) 
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where Zy is the partition function defined in Eq. (4.2), now 


A OZN os, oe oo exp[—B > V] d2*R 


Zy OB Zy 
_ [ Voras) exp [-B EV] gap... 
Zn 
. _NW-) V(r ) xPL-A2XV] d3"R 
> 12 Zn ° 


Here, we use Eq. (4.65) to arrive at 


1 @Zy 1 
a = Vien d°R, d°R 
Zn ap | (T;2) 2(7 42) 1 2 
or 
U = = RT + “ V(r)n,(r)4nr? dr. (4.73) 


This equation could have been expected. The first term of Eq. (4.73) is the 
kinetic energy, and the second term is the potential energy. The latter comes from 


the pair potential V(r), and the factor 1/2 appears because each pair is counted 
twice. 


The pressure is expressed as follows: 


pV = RT — “e R-VV(R)n,(R) d3R (4.74a) 
or 
Vol ; 2 
pV = RT - mee rV'(r)n,(r)4ar* dr. (4.74b) 


These are classical formulae, which do not hold for liquid helium. Equation 
(4.74b) is derived as follows. From Eq. (2.121) 


ij 
i<j 
As in Eq. (4.73), (KE) = (3/2)RT. The second term of Eq. (4.75) can be 
written 


(z ri") = ” Cri '(r12)> 


—~ NN =D], ye erenatvendt 9) 
Zn 


= | raY Coane) d*(R, — R,). 
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Equation (4.74b) follows. Equation (4.74a) is nothing but the obvious 
generalization to the case of potentials that are not spherically symmetric. The 
product R:V V(R) in the integrand of Eq. (4.74a) is called the virial. 

When the force is homogenuous of degree (n + 1), 

V(r) = —cr’, 
so that 
rV'(r) = nCr" = nV(r) 


we can use these in Eqs. (4.73) and (4.74b) to see that 
3pV + nU = (3 + 3n)RT. (4.76) 
This relation is called the virial theorem. 
Problem: Show that at low densities 
nr) = 1 — eFV©), 


Using this result in Eq. (4.74), derive the virial coefficients in the virial expansion. 


4.66 THE BORN-GREEN EQUATION FOR n, 


In Section 4.5 we pointed out that one can find all thermodynamic functions 
given n,(r). In Section 4.1 of this chapter we showed how we could find (in 
principle) e~ °* given the potential between pairs of particles, and from F we can 
also get all thermodynamic functions. But to get e~ °* we must integrate over 
something like 107° variables. Computers cannot do such an integration; so we 
have got to find some way of getting the answer approximately. The Mayer 
cluster expansion was one way, and it was satisfactory for a gas. So far, for a 
liquid, we have replaced F by n,(r), which requires six fewer than the 107° 
integrations for F. Clearly, we still have quite a way to go. One approach 
toward approximately evaluating our monster integral is by means of the Born- 
Green equation. 


n,(R,, R,), defined in Eq. (4.65) can be transformed as follows: 


n,(R,, R2) = a? | ex| -p > v | d?R,:°:d>Ry 
N 


zs MD | ex |-# » VR) - BY ve) 
x i#1 yal 


x d3R,°°-d?Ry.’ 
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Differentiate the above equation to obtain 


Vin(R,, R.) = — ellie |] v7) os vd | 
i#2 


ZN 
xX eXp (-* y; v) d°R;:°°d?Ry. (4.77) 
pair 
When we define 
N! 
n,(R,, R,, R;) = ——————_ | exp] — V | d°?R,-++d?Ry, 4.78 
s(Ry, Ra, Rs) erat P| pxY oO Ry (478) 


we can write Eq. (4.77) as 


V,n(R,, R,) = — BV, V(R,2)n(R1, R,) 
_B | V,V(Ry3)n3(R,, Rp, R;)d2Rs. (4.79) 


This is the Born—Green equation.* 


The special feature of this equation is that bothn,(R,, R,) and 73(R,, R2, R3) 
are involved. When we go further and write the equation for n; we need n,. The 
equation for n, includes n;, and so on. Thus, we have a hierarchy of equations. 
This kind of hierarchy appears in other branches of physics—for example, in 
meson field theory—but no general, accurate way of treating the set of equations 
has been established. When we make a guess for n,, say, and cut off the hierarchy 
there and calculate back n,, we might get a good estimate of n,; but that an 
error in the guess of nm, becomes unimportant as we approach n, has not been 
proved. 

An ingenious way of closing the hierarchy is due to Kirkwood. When R; is 
very far from R, and R, 


n3(R,, R2, R3) > n2(Ri, Rp). 
Considerations such as this led Kirkwood to the approximation 


n(R,R,)n,(R,R;)n(R3R;) 


nj 


n3(R,, R2, R3) = (4.80) 


which is called the Kirkwood approximation,f or the superposition approximation. 
When we use Eq. (4.80) in Eq. (4.79) we can solve for np. 


* Born M. and Green H. S., Proc. Roy. Soc. (London) A188, 10 (1946). See also 
J. Yvon, Actualities Scientifique et Industrielles (Herman and Cie, Paris 1935), No. 203. 


+ Kirkwood J. G. and Boggs E. M., J. Chem. Phys., 10, 394 (1942). 
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4.7 ONE-DIMENSIONAL GAS 


Our attempts to evaluate the integrals involved in the thermodynamics of liquids 
and gases have not met with complete success. The Kirkwood approximation 
‘might not be bad, but we do not know exactly how to generalize it, or whether a 
generalization would give better results. What we want is a method that would 
give progressively more accurate results as we increased the computational labor. 
No such method is available today. 

We are faced with a problem that is too hard for us. We may be able to gain 
some insight by picking a soluble problem that has some of the features of the 
insoluble one. 

For example, consider gas particles constrained to move on a line (Fig. 4.7). 
First we assume only nearest-neighbor interaction. The partition function is 


Zy(L) = e” PIV (x1 — x2) tV (x2~ x3) + - ‘Idx, dx, aes dxn, (4.81) 
O0<x1<x2<°°+<xn<L 


The length L 1s fixed. 

We would be making a good start toward solving our problem if we could 
get a recursion relation with Zy,, expressed in terms of Z,. Since we cannot 
even do this, we will modify the problem in such a way that the macroscopic 
properties of the system are not changed but a recursion relation for the partition 
function can be found. We put one extra atom before x, and fix its position to be 
at y (see Fig. 4.8.) The resulting partition function we call Py(y, L). 


P,(y, L) za e BLIV(y— x1) +V (01 — x2) + es ‘Idx, dx, re dxy. (4.82) 


ypoxy<x2< °° <ZL 


In Eq. (4.82) y is fixed. Now, consider the case shown in Fig. 4.9. From Eq. 
(4.82) we can write 


Py i(z, L) = eT LV (Z—y) FV (y— x1) + a ‘Idy dx, rae dxy. (4.83) 


Z<y<xy~<0° °° <L 


0 x Xo X3 X4 Xn L 


$e 9» "=e 


Fig. 4.7 Gas particles constrained to move in one dimension only. 


0 y x} Xn L 


Fig. 4.8 One atom fixed at y. 
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0 Z y xy Xo Xn L 


}_ $99 


Fig. 4.9 One atom fixed at z. 


In the integral, z is fixed, but y varies. From Eq. (4.83) 
Py41(z, L) = ere dy 
z<y 
x e PIV x1) +V (x1 — x2) +: ‘Idx, ve dxy 


y<x1<° ° -<ZL 


Py(y, L)e8”@-» dy. (4.84) 
z<y<L 


Using Py(y, L), we can construct the grand partition function 


“AO = J efiNe-BFw = YY gbun 1 f_m NV"; 
é = ee = e ae . 
N N N! (sa75) ™ 


In our case, Zy/N! = Py(y, L) (the factor 1/N! comes from the fact that we have 
in the integral for Py restricted the range to y < x, <-::: < L instead of 
allowing each particle to be anywhere in [0, L]). We can absorb the factor 
(m/2nh? B)N’? in the e*"" by adjusting py. Then 


e*9 = ¥ ef "PA y, L). (4.85a) 
N 


As in Section 1.1, g for a combination of several independent subsystems, 
each with the same p, is the sum of the g; for each subsystem. In Section 1.6 we 
defined F = g + nN, where N is the expected number of particles in the system. 
Clearly, F for a combination of independent subsystems is also a sum of the F’s 
for each system. 

If there is a sufficiently large number of particles, we can consider the line to 
be a combination of line segments that interact with each other very little. With 
such considerations, it is easy to see that F/V is a function only of the density 
(at a fixed temperature), and F can be written F = Vf(V/N). It is not hard to 
show that F of the above form is equivalent to the equation g = —pV. The 
proof is as follows: From Section 1.6: 


OF 


= ON 


; Dp 
7 av 
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Using F = Vf(V/N) we then have 
g+pV =F — UN + pV 


Because the “‘volume’”’ of the system is L — y, we can now write Eq. (4.85a) as 


efP(L-y) — S ePHNP(y, L). (4.85b) 


N 
Multiply Eq. (4.84) by e%**#? and sum over N to obtain 


Zz 


L 
eP(L—z)B — ebb | eP(L—y)Bo- BV(z—y) dy, (4.86) 
which leads to 
L 
a ee | e~ P28 BVE-Y) dy, 
Zz 
or 
0c 
e be — | e” PPXe~ BY(™) dy, (4.87) 
10) 
The integral is insensitive to the upper limit, so we have replaced that limit with 


oo. This is the desired result from which we now can find all the thermodynamic 
quantities. For example 


Op _ 7 OP 
Ou Ou Ou Ou 


Equation (4.87) can be written‘as 


PN icc OO a CPU) NO ef CP 


0 = pu + log | ° er Pe- PV) dx = fly, B, plu, B)]. 


0 


From 0 = df/dp, it follows that 


Ou Of/Op Je xe PP*e7 BV)” 
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Problem: When the nearest- and next-nearest-neighbor interactions are in- 
cluded, derive the chemical potential. (Hold two atoms fixed near the left end 
of the line and derive the recursion formula. An integral equation results.) 

We see now that we can solve the problem of particles fixed on a line if the 
potential acts only between the nearest few neighbors. But what if all the atoms 
in the system interact? For a certain special form of the potential we can still 
solve the problem. Section 4.8 discusses the case of an e~ *! potential. 


4.8 ONE-DIMENSIONAL GAS WITH POTENTIAL OF THE FORM e7!+! 


We will now discuss the problem of a one-dimensional gas, all particles of which 
interact repulsively with all others with a potential V(x) = e~ "I. 

When there is attraction and no repulsion, the preferred configuration 
occurs when the particles all sit on top of one another. The potential V is pro- 
portional to the number of pairs, N(N — 1)/2. The more pairs there are, the 
lower the potential can get, and ultimately all particles collapse to a point. This 
result does not tell us much, so we assume a repulsive force. 

Assume N particles are distributed in a line of length L (Fig. 4.10). We wish 
to evaluate 


L 
1 fm \ ae 
—BFn = ___. ex = e [xi x,| dx oat ax 
omen (aes) |\[fp ore en 
0) 


= ( hh yo exp |-F oy gonad dx, ++: dxy 


2nh? Bp <j 
0<x,< <xn<L 
=( exp —B e“'(e *2 foo+e 4 @ ¥) 
4) 
ie O<x1<°*°<L 


+ »» eri *H) dx, eee dxy. 


j>iz2 
(4.87) 


Fig. 4.10 One-dimensional gas. 
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Suppose, as before, we modify the problem slightly by placing an extra 
atom before x,, and fix the position of that atom at y. Let « = e”. Later we will 
set y equal to zero. The partition function for this system is 


I,(a, L) = | exp \-é jae +e 2 +-+++e *%) 


O<y<x1<-°°-<L 


j>iz= 


4 y a | dx, ++: dx. (4.88) 


Then 


God) oe -6 jae peeing) 


O0<y<xo<x1< coe cc] 
+ > eli —*s) dXo eee dxn 
jrizo 


exp (— Bae *°) 


O<y<xo<-°::<L 


xX exp "8 | + e *)\(e7*1 4 +++ + eo *%) 
+ Y a | dX +++ dxy 


j>i21 


L 
| exp (— Bae *°)Iy(a + e*°, L — y) dxo. (4.89) 
y 


As in Section 4.7, we can again modify the problem in a manner that does not 
affect the macroscopic properties. We allow N to vary, put in a chemical 
potential, and define 


e-F9 = SY Ina, Leh, 
N 


Because Iy(«, L) is the partition function for a system with one particle fixed 
at y = log a, its length is L — log a, andg = —p(L — log a). 

Multiply Eq. (4.89) by e*+"#*, sum, set y = 0 and replace L by oo. The 
result after a little algebra is 


ee — \. exp {—fl[e*° + p log (1 + e*)]} dxpo. (4.90) 
0 


From Eq. (4.90) we can obtain all the thermodynamic properties, and we can 
reasonably assume that the atom fixed at zero does not change those properties. 
But let us see what we can do without fixing an atom at zero. The force on the 
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first atom, the one located at x,, is due to the rest of the atoms in the gas, and 
the effect of the other atoms can be described with a potential involving one 
extra parameter. Thus the first atom moves in a potential ce*! — 1, where 
c = >),;e “. Instead of specifying the position of one of the atoms as an extra 
parameter in the problem, we will specify c, which determines the potential felt 
by the atom at x,. We now calculate G,(c), which is the partition integral: 

L 


|e |p >. om dx, eee dxy 
i<j 
with the range of integration subject to the restrictions that 


ye ae 


and 0 < x,°°:< xy < L. Mathematically, 


Gy(c) = | exp | A > oe 6 (: -y “*) dx,°:: dxy. 
l<J 
O<xy<°+*+<xn<L 


Note that G,(c) = O if c < 0, and 


eFFn — (_™ le Gy(c) de. 
2nh? B 0 L 


To get a recursion relation we will hold x, constant until the rest of the integra- 
tions are done. 
Equation (4.91) can be rewritten 


Gy(c) = [ew | -# 2s e7 (xi *1) _ B Di aide 


i<j 


i 


x 0 (: —e *— >» “| dx,°+: dxy, (4.92a) 


where 


i 7 i#1 
Because of the action of the 6-function, the >"; e~* in the exponent of the inte- 
grand can be replaced by c — e'*!. To make the argument clearer substitute 


x, = X; — x,. Thus 


L L 
Gy(c) = | ie { ( Len ene *) 
L 0 ‘a 


xX exp |-# yet = sy yn dx,**:dxy. — (4.92b) 


i<j; 
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We know that 
) (: ie te ea)! ~*) = e'6 (« er — 1 — a —*) 


because 6(ax) = (1/a) 6(x). Equation (4.92b) can be rewritten 


Gyo) = | "dx, exp (1) exp [~A(ce™ — 1)] | ™ 5| ce sae a 
: 8) 


8) : l 


x exp E be ini dx,°+:dxy 


i<j 
L 
- \ dx, exp (x;) exp [—B(ce*! — 1)]Gy_, (ce* — 1). (4.93) 
) L-x1 
In general, the solution comes from Eq. (4.93), but it is very difficult to find. 
For subsequent use, let us define (for N particles) 


Ys aGy(c) = e®?*g(c) (4.94) 
N L 


a = eft 
and 
p = pressure. 


Combining Eqs. (4.93) and (4.94), we find that 


0 


g(c) = a \, dx exp (x) exp [—B(ce* — 1)]e~*?*g(ce* — 1), (4.95) 


where the substitutions x = x, and extension of the limit L — o have been 
made. In obtaining the eigenvalues of this integral equation, a relationship will 
be found between « and p; g(c) itself is not of interest. The boundary value on 
g(c) is g(c) = Oif c < 0. To put Eq. (4.95) in terms of differentials, substitute 


h(c) = e-*°g(c) 


g(c) = 2| dxe*e~ ®P*h(ce* — 1) 


0 

g'(c) = “| dxe**e~ ®P*h'(ce* — 1) 
0 

x 

c 


| exe” Bx “ [h(ce* — 1)]. 


0 
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Integrate by parts: 


g'(c) ~ A(ce* — l)e*e7 FPx|> aI h(ce* — 1)(1 — Bp)e*~®?* dx 
c Jo 


= up MOE) y= ~ i(e — 1) 


= oe = OOD AY Ge ee ge 1), (4.96) 


Equation (4.96) appears at first glance to be a differential equation, but is not, 
because g(c) and g(c — 1) enter into it. 
For another form, substitute 


g(c) = eF"/F(c) 
g'(c) = —Bee~P"l7Ff(c) + e FP? F'(c). 
Then Eq. (4.96) becomes 


cf'(c) — Be*f(c) — a(Bp — I) f(c) = —ae’l’f(c — 1). 


We can use the Laplace transformation 
ota) = |* espe de 
0 
to obtain an equation in g(q). Thus: 


|. e *cf'(c) dec 


= | ” en *F'(c) de 
dq Jo 


d 
= da [qe(q)] 


and 
Oe oe d? 
| er#e4f(0) de = F, 90) 
O 
Therefore 
d d? | B/2,-4 
— [qe(q)] + B—  o(@) + «(Bp — 1)e(q) = aeP'e 49(q). 
dq dq? 
Substituting 


o(q) = eT !**W(q), 


we eliminate the first derivative term: 


~py"(q) = fs a. ~ aefl2e-* + a(Bp — 1) + 1] v. (497) 
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If instead we use Fourier transforms 


9(o) = | =F (c) de 


and substitute 
Zaye 
W(w) = e°'* o(a), 
we get 


Bw'(@) = + webl2gi? 4 a(Bp — 1) + ») W(w). (4.98) 


The boundary conditions are not known definitely in this transform notation. 
However, it may be that the only restriction necessary is that the function 
W(q) or p(w) remain finite everywhere. We can now, in principle, get a functional 
relationship between « and p. 


4.9 BRIEF DISCUSSION OF CONDENSATION 


For a one-dimensional gas such as the one we have just considered with repulsive 
forces there is no condensation. Let us consider instead the case where there are 
both repulsion and attraction, such as the one depicted by the potential of Fig. 
4.11. At T = 0° all atoms are adjacent to each other in positions corresponding 


to V,,;, in Fig. 4.11: 


As the temperature is increased there is a continuous transition from the “‘solid”’ 


to an aggregate of groups of atoms: QOOO0O OQOO0O Cocco. The 
energy required for breaking up one group of atoms into two groups is roughly 


the energy required to break two atoms apart and we call it ¢. The probability 
of such a break is e~ */*, and when the temperature is raised it happens more or 


less uniformly. In three dimensions one might have a group of N atoms Bo 


Vinin 


Fig. 4.11 A potential with attraction and repulsion. 


126 Classical system of N particles 


The energy required to cut across a group ise = N7/? A, where A is roughly the 
value of the energy required for taking one atom away from the group. Thus, 
the probability for splitting up the group is e~%’”°4/S7, The larger N is, the 
higher T is required to be for a finite probability of breaking up a large group of 
atoms. Thus, in the one-dimensional case, a finite energy is required for breaking 
up a group of atoms, a finite number of groups exist, and the fraction of atoms 
(essentially infinite in number) in the condensed state (that is, in the largest 
group) is zero. In the three-dimensional case, because of the probability factor 
e NAIK 4 finite fraction of the total gas can exist in the condensed state at a 
finite temperature. 

For solids the difference is more subtle. In one dimension, as the temper- 
ature is raised, let the mean square error in the position of one atom relative to 
the ones on each side of it be 6%. Then, over a group of N atoms, the mean 
square error in positions of the first and last atoms is N6” from the perfect 
lattice configuration. The error “‘propagates’’ (in a sense), and when temperature 
is increased, the solid expands monotonically. There is only one path along 
which information can be transmitted. In three dimensions, the position of an 
atom in one place influences that of a distant atom much more strongly than in 
one dimension—there are many paths connecting the two along which forces 
are being transmitted. Thus to melt a solid T must become high enough that 
the thermal energy can overcome these “‘multipath’’ forces. 


CHAPTER 5 


ORDER-DISORDER THEORY 


5.1 INTRODUCTION 


We will now consider a new class of statistical-mechanical problems, problems 
that do not involve motions of atoms but instead involve their positions in a 
lattice. More precisely, we will consider alloys made up of two types of atoms. 
For instance, in the cubic lattice shown in Fig. 5.1, there will be some preferred 
arrangement for the manner in which the lattice sites are occupied by type A 
and type B atoms. Let us take the case of zero temperature. Essentially, two 
types of lattice arrangements are possible, one of which is depicted in the figure, 
and they can be understood from a physical point of view as follows. Let V4, be 
the interaction potential between unlike atoms and V,,, Vgg the potential 
between the like atoms. If V,, > (V4 + Vpp)/2, then A atoms prefer to be near 
A atoms and likewise for the B atoms. The solid would tend to exist in two 
distinct parts—one of A atoms only, and one of B atoms only. If Vy, < 
(Via + Vep)/2, the A and B atoms would tend to alternate in position throughout 
the solid. If the solid is heated, then gradually A and B atoms will exchange 
positions in a random way until, at the Curie temperature, the order “‘melts’’; at 
still higher temperatures, disorder sets in. It is of interest to compute the parti- 
tion function for such a system: 


e mph = >; exp = > B V; j}e 
arrangements (i,j,bonds) 


of atoms 


| | | 
~O-@-O- 
| | 
-O-O-O- 
| ! | 
~O-O-O- 
| | | 
| | | 
Fig. 5.1 Cubic lattice with two types of atoms, A and B. 
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Define: 
€é; = +1 for A atom 
= —1 for B atom 
N,, Ng = number of A, B atoms, respectively. 
Thus, 


ee; = +1 if AA pair 
+1 if BB pair 
= —1 if AB pair. 


Considering nearest-neighbor interactions only and the total N, + Ng as 
constant, we will write the interaction energy in the form 


Vi; = age; + (*5%) +c 


Then, 
Vig, =at+bt+e 
Vas ~A—-b+ Cc 
Vip = —-at+ec 
and 
ro ‘44 So Vip 


Thus, the exponential in the partition-function equation above is 
exp (fe >» et) exp (-#» aS) exp (— Bc). 
i,j i,j 


The last exponential, e~ °°, is a constant and does not enter into calculations. 
In the sum }'ponas (€; + &;)/2, each atom contributes to the sum a certain 
number of times. The number of times an atom contributes is equal to the 
number of bonds it makes, that is, the number of nearest neighbors it has. For 
example, in the case of a two-dimensional square lattice, each atom has four 


nearest neighbors; so 
a ee 2(N, — Nz). 
bonds Z 


In general, if N, and Nz, are kept constant, we can ignore >) (e; + &,)/2 because 
it does not depend on the arrangement of the atoms. The problem then is to 
calculate for the partition function the quantity 


¥ exp (- pay et) = Seer ( -H “ts | 
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But when we sum quantities over all possible arrangements of the atoms, the 
restriction that N, — Ng be constant may cause difficulties. To relax this 
restriction, we put in the usual “‘chemical potential.” In other words, we change 
the problem (but not the results of the solution of the problem) by saying that 
N, — Ng can vary. Associated with variation of N, — Ns; is a proportionate 
amount of chemical potential, which has the effect of making a certain ratio 
N,/Nz the most probable. In such a case, we cannot ignore 


6 tb; _ y 


bonds 2 atoms 


Our problem reduces to the calculation of 


Lew (—# y ee, -J Y “) 
bonds atoms 


where, in summing over possible arrangements, we still take N, + Ng = N 
fixed but no longer fix N, and Nz, separately. If J = 0, we are obviously 
solving the problem of (N,) = (Nz). For practice, we will evaluate the above 
sum for the one-dimensional case. Then we will consider an approximate 
method that works for two and three dimensions, and finally we will find the 
sum exactly for two dimensions. The exact solution for three dimensions has not 
yet been found. 

It should be noted that the above model can be taken as a model of ferro- 
magnetism. We interpret ¢, = +1 asspinupande, = —1asspindown, where 
the atoms are taken to have half-integer spin. A better model for ferromagnetism 
is to take the 


Hamiltonian = H ) S,-S, 
bonds 


where S; is the spin operator for the ith atom. This model is not equivalent to 
the model with ¢,¢; terms. For example, a two-atom system has 


S,°S, = aL(S; + S,)° = 3h], 


which has the value 44” with a priori probability three times as large as that for 
which the value of S, -.S, has value —3%?. That is, there are three ways in 
which two atoms of spin 4 can combine to give a system with total spin 1, but 
only one way in which they can combine to give spin 0. On the other hand, 
€,€;, takes on its two possible values with equal a priori probability. 

Although the two models are different, the qualitative results might be 
similar. For the rest of this chapter we will discuss the model in which the 
energy has terms é,¢,. This is the Ising model. 
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5.2 ORDER-DISORDER IN ONE-DIMENSION 


For practice, let us evaluate 


c= Y' exp(-H Yavin, - J Da 
1 


in one dimension. 
First, suppose J = 0. Then an easy way of doing the problem is to define a 


new variable: 
1 if ¢ = & 
- [ J 


1 if ¢ = —&.4, 


= 
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eee 
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(2 cosh H). 


So F = —N/f In (2 cosh A), and we have completed the calculation. 

If J # 0, the above method does not work. We have to use what is, by 
now, a familiar trick. That is, we make up a new problem that, for large N, is 
almost the same as the old one, but for which a recursion relation can be found. 
We will try to evaluate the partition function for a system in which the particle 
on the end has a fixed e. 


Let 
XyEver = YY exp[—H(eye. +++ + eyty41) 
fe eeats 
—J(e, + & +°°* + Ey44)]. 
Then 
Xwy+i€ye4) = a X y(év+1) Xp [— Heys 1€v+2 — Jen +2]. 
EN+1 24 


The free energy for a large number of atoms should be proportional to the num- 
ber of atoms; therefore 


Xy+i(€w+2) = eP¥X n(n +2) 


where p is independent of N for large N. The free energy is then F = —pN. The 
recursion relation then becomes two simultaneous linear equations 


Pex (y= YY XyWe 2’ where y= +1. 
x=H+1 
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If we think of (Xy(1), Xy(— 1)) as an eigenvector and e™ as an eigenvalue of the 
above equation, then it is easy to see what to do next. To find eF it is necessary 


to solve 
pee ais eg H-J _ fu Hts 
oH-J -H+J __ pbul’ 


ef! = eH eoosh J + e8/1 + e7* sinh? J. 


The result is 


There are two solutions here; which one do we want? If we look back at the 
case of J = 0, it is obvious that e’" becomes 2 cosh H = e~” + e”, so the 
general result is 


of! — eH oosh J + ef/1 + e744 sinh? J. 


Problem: Try to get the free energy as a function of the ratio between the 
number of A’s and the number of B’s. 


5.3 APPROXIMATE METHODS FOR TWO DIMENSIONS 


You may have noticed that in Section 5.2, none of the equations were considered 
important enough to number. This is because we were not especially interested 
in the answers, but instead wanted to examine some possible ways of looking at 
problems. Perhaps you will be able to improve upon and use some of the 
methods described here, or maybe you will find these methods useful for solving 
a completely different problem. In this section we will nibble at the edges of 
another problem: that of order—disorder for a two-dimensional square lattice. 
For simplicity, let us take J = 0. 

One approach is to find a recursion relation. To do this, find the partition 
function for the case where all the e’s in the last column are fixed. Suppose 
there are M columns of N particles each, and let Xy4(é, &2,..., &y) be the parti- 
tion function. As in the one-dimensional case, the free energy is proportional 
to the number of atoms. It follows that 


N 
ePYNX (Er, -- +5 €n) =X moil€y,---, En) 


= =. ane {—H[e,9, + 620, + +++ + eyOy]} 


x exp {—H[ee. + °°* + ey_yEw]}Xuy(O,,..., Oy). 
(5.1) 


The analogous equation in Section 5.2 required for its solution the diagonaliza- 
tion of a two-by-two matrix. The above equation requires the diagonalization 
of a 2%-by-2” matrix. 
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Given the matrix equation MA = 1A with M hermitian, a well-known way 
for finding the solution with the minimum or maximum 4 is to try to find the 
extremum of A - M4A/A- A. This method might be applicable to Eq. (5.1). Let 


X y(E1, ee ey Ey) = exp {—4H[e,6, giant Ey —1En |$Z(e,, oe Ey): 
Then Eq. (5.1) becomes 


ermNZ(E1,--->€v) = DY exp {—4H[ee. + +++ + ey_yey]} 
) ) 


x exp {—4H[0,0, +--+ + Oy- Oy} 
x exp {H[e,0; +--+: + eyOy]$Z(O,,..., Oy). 


We made this substitution in order to get an eigenvalue equation with a sym- 
metric real matrix, that is, a hermitian matrix. Then, to solve this equation, we 
must find the extremum of J/D, where 


I — > Z(6,, cc eg On )Z(E; a En) exp {—4H[e,8, + o + Ey — 18 ]} 


05,8: 
x exp {—3H[9,0, + -** + Oy_,On]} exp {—H[e,9, oe éyOy }}. 
D= 2» |Z(é,, ae Ey) I. 


Of course, it is not likely that you will be able to get an exact extremum of 
I/D. But it should be possible to get an approximate one by cleverly choosing an 
appropriate function Z(é,,..., &y). To be more systematic, one might choose 
a class of functions Z,(é,,..., &y) and maximize J/D with respect to «. ef#% 
should be equal to the maximum possible value of 7/D. 


Problem: Find as large a value of //D as you can. In evaluating certain sums, 
try setting 0y = 0, and éy = &,; it might simplify matters if the two-dimensional 
lattice were wrapped around a cylinder. 

We have been considering interactions such that when the temperature is 
absolute zero, atoms of type A and of type B alternate in the lattice. For the 
case of two or three dimensions, if the temperature is low enough, the crystal has 
long-range order. If we know what type of atoms are found at each lattice 
point in one region of the crystal, we know what type of atoms are likely to be 
found fairly far off. Note that in one dimension one wrong atom at a lattice 
point reverses the expected order of atoms on the other side of the defect. 
Thus, in two or more dimensions, we can speak of « sites and f sites, where 
a sites are those lattice points at which an A is more likely to be found. In one 
dimension we cannot do this. For the case of two dimensions, Bethe used the 
concept of « and f sites to describe order-disorder phenomena approximately.* 


* Bethe, H. “Statistical Theory of Superlattices,’’ Proc. Roy. Soc., 150, 552 (1935). 
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Fig. 5.2 Two-dimensional lattice with « sites and sites. 


Following Bethe, we will work with only a few atoms of the lattice and will 
try to describe the effect of the rest of the atoms by one parameter. We will look 
closely at only the atoms pictured within the dotted line (region I) of Fig. 5.2. 

The relative probability of a given distribution of atoms in the entire crystal 
is exp (—H Sbonas £:€;) (We are assuming there are just as many A atoms as 
B atoms). If we consider only the five atoms in region I we have to allow for the 
fact that the four outer f sites are affected by region II. We do this approximately 
by saying that the relative probability of a given distribution of the five atoms is 
e" exp (— H >! ¢;¢;) where n is the number of f sites in region I that have the 
wrong type of atom (A) on them. The sum is only over the four bonds in 
region I. ¢ can be thought of as 


__ a priori probability that a B site in region I has an A atom 
a priori probability that a f site in region I has a B atom 


By “‘a priori probability” I mean the probability that would be assumed before 
the thermodynamic effects of the interaction are taken into account. That is, 
even if there were no interactions between the atoms of region I, the interaction 
of the four external atoms of region I with region II would still tend to cause the 
B sites to be occupied by B atoms. Notice that n is not the total number of sites 
of region I that have the wrong atom. Ifthe « site in the center has a B atom, we 
do not multiply the relative probability by another ¢ because region II does not 
interact directly with that atom. Note also that the atoms inside region I can 
affect each other by way of region II, so ¢ is only an approximate way of describ- 
ing the effect of region II. 

For a given value of ¢ we can compute w, the relative probability that the « 
site in region I is occupied by a B atom. We can also compute w’, the relative 
probability that a given f site of region I is occupied by an A atom. We require 
that w = w’, because the probability of a wrong atom occupying a given site is 
independent of the site. The equation w = w’ will allow us to fix e. 
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Some of the results of Bethe’s method will be derived below. 


w= -5 (relative probability that the a site in region I is occupied by a B atom 
and there are n B sites occupied by A atoms). 


4\ _ 4! 
(") —nt(4—n)! 


ways in which nf sites may be occupied by A atoms. It is easy to see that 


» ee; = 4 — 2n. 


There are 


bonds 
Then 
4 4 4 4 
w= Y ( ) eterno a e tH > ( ) ecrny as e 441 4 ge2H)4 
n=0 \Nl n=0 \n 
= (e°" + eel)*. (5.2) 


w is a relative, not an actual, probability. The actual probability that the « site 
has a B atom is w multiplied by the appropriate normalization factor. 

Let r be the relative probability that the « site has an A atom. The equation 
for r is the same as that for w, except Ho —H. 


r = (e# + ee *)4, (5.3) 


The normalization factor is 1/(w + r); so the actual probability that the site has 
a B atom is 
(e-F + seM)* 
(ce? + eel)* + (eF + ce Ht 
w’ oc % (expected number of f sites of region I with A atoms) 


4 
w’ = 4 Y n{[(relative probability that n B sites are occupied by A atoms and 


n=O the a site has a B atom) + (relative probability that n B sites are 
occupied by A atoms and the « site has an A atom) |] 


4 
=4£¥n n() 8 e"[e HG— 2) 4 gH(4-2n)) 
n= 


0) 
: H(4-2n) H(4-2n) 
= 2 (, : i): "Leo +e | 


= seh(e 4 + ge#)? + ce B(e# + ce ")?. (5.4) 
Equating w and w’ from Eqs. (5.2) and (5.4), we find 


(e-# + ee”)* = eet(e"# + ee#)? + ce M(e# + es 4). 
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SO 
(e~# + ge#)3(e-# + ce — ce#) = ee M(e# + ce ¥)°. 


Simplifying our equation for ¢, we get 


~. H 


ef + se? 


Notice that if ¢ satisfies Eq. (5.5), so does 1/e. This fact holds because nowhere 
in the mathematics of our derivation did we specify that « sites, rather than B 
sites, are the locations for which A atoms are more probable. 

Solving for e?# in terms of e, we find two solutions: either 


gis 


i — 21/3 ~1/3 
= 7B gi el- + & (5.6) 


or else ¢ = 1 and A is arbitrary (the latter solution is most easily seen from 
Eq. (5.5)). 

The minimum value of e'/? + ¢~*/3 is 2; so if H is too small (high temper- 
ature) we must take e = 1. From the definition of ¢, ¢ = 1 corresponds to 
complete disorder. We can expect discontinuities in certain thermodynamic 
quantities or their derivatives at the temperature for which complete disorder 
sets in. 

As an example of a thermodynamic calculation using the above model for 
order-disorder, let us calculate the energy. 

Suppose a bond is an A—B bond with probability p. Then with probability 
p a given bond contributes energy — H/f, and with probability (1 — p) the 
bond contributes + H/B. The expected value of the energy is 


E = (number of bonds) E Ge + (1 — p) Gal = 7 (1 — 2p) (5.7) 


where WN is the number of atoms. 


[ee"(e~* + ee”)? ] 
wt+r 


is the probability that a given B site of region I is occupied by an A atom given 
that the « site has a Batom. Except for the normalizing factor 1(w + r), this is 
one of the terms in w’. 


1 
w+r 


1— [ee "(e" + ce *)?] 
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is the probability that a given f site of region I is occupied by a B atom, given 
that the a site has an A atom. The probability that a given bond between an « 
and a f site is an A—B bond is therefore 


Hy ,-H H\3 —Hy_H —Hy3 
peo (e-" + ee”) +(t _ ee "(e" + ee r). (5.8) 
w+r w+r 


H/B is a constant independent of temperature. Using Eqs. (5.7) and (5.8) we can 
get C, = OE/0T. We get the entropy from the equation _ 


S(T) = S(T) — S(0) = | re (5.9) 


‘a 
0 

At T = oo, we can compute the entropy exactly from its definition. Recall that 
S = —k >, P, In P,, where P, is the probability of state n. There are 2” states 
of our N-atom system, and each has equal probability at infinite temperature. So 


S = —k(2*) € In 4 = Nk In 2. (5.10) 
Our approximate calculation, using Eq. (5.9) gives S = Nk(0.697) instead of 


Nk(0.693). 


5.4 THE ONSAGER PROBLEM 


Consider a two-dimensional square lattice of N atoms. Each atom is designated 
by +1 or —1. That is, the ith atom carries a number ¢;, which is + 1 or —1. The 
energy of interaction of nearest neighbors is + He;e,;in units of KT. A state of the 
system is denoted by giving the ¢; for each atom. The problem is to calculate 
the free energy of the system. 
The partition function 
— — PEK 
= e 
. 2 all da, 
—Z*Heje; __ * ~.— Heje 
= e j= Ii*e 4, 
2 all d all states 
where >“* and [[* mean the sum and product on pairs of points i and j that are 
adjacent on the lattice. It is easy to see that if the sign of H is changed, Q 
remains the same. For convenience we will write 
Q= YD res (5.11) 
all states 


and take H positive. 
Since €;é; = +1, 


e —ée — pt citjH 
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is obviously true. So 
eHeei — cosh H + ee; sinh H 
= cosh H(i + ¢e,T), (5.12) 
where 7 = tanh H. (Note that T does not represent temperature in this section.) 
From Eas. (5.11) and (5.12) we find 


Q = > J]* [cosh A(1 + ¢,¢;T)]. (5.13) 
Remembering that for an N atom lattice there are 2N bonds, we have 


OQ — cosh2% H y T1*(1 + e€;T ) = 2%Q’ cosh?" H. (5.14) 


all states 


We must now find the modified partition function 


g=+ ¥ Td + e8,7). (5.15) 


QN all states 


Expanding the product we use the notation )\f to indicate a sum over all possible 
sets of / different bonds. Then 


Wor= VY: YY {1 + Tree, + T?XZ (e¢)(e-e) + °°: }. (5.16) 
ey=t1 en= +1 
Note that in each term, the e’s occur in pairs corresponding to nearest 
neighbors, and no such pair occurs twice in the same product. 
Associate with each pair ¢,¢; a bond connecting the ith and jth atoms (i and j 
are nearest neighbors). With each term or product of 2/ ¢’s, we associate a graph 
or set of / bonds. Because no pair ¢,e; appears twice in a given term, no bond 


appears twice in a given graph. For example, Fig. 5.3 gives the graph associated 
with the term 


(€1&2)(€3€4)(E5&6)(E1&5)(E2&6)(E387)(E4€g)(E7€g)(E10&11)(E1 081.4): 


1 2 3 4 

5 6 7 8 : 
° 9° T "2 ~CO«*s 
13° «44 715 *6 ° 


Fig. 5.3. A graph or set of / bonds. 
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Fig. 5.4 Example of a closed graph 


Now, 


eé=0 and 
e=+1 e=+1 


M 
M 
I 
IS) 


(5.17) 


From Eggs. (5.16) and (5.17), it is clear that only those terms where each ¢; 
appears an even number of times will contribute to Q’. This is equivalent to 
saying that the only graphs that count (or contribute to Q’) are those in which 
each atom or lattice point has an even number (0, 2, or 4) of bonds emanating 
from it. In other words, the contributing graphs must be superpositions of 
simple closed polygons having no common sides. Such graphs may be called 
“closed graphs.” 

Figure 5.4 gives an example of a closed graph. Note that the graph of Fig. 
5.3 is not closed and does not contribute to Q’. Figure 5.5 is another example of 


Fig. 5.5 Example of an ““Unclosed”’ Graph. 
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a nonclosed graph, which contributes zero. Each term in Eq. (5.16) correspond- 
ing to a closed graph has each «; occuring to an even power (0, 2, or 4), and 
therefore equals simply 7”, where L is the number of bonds. Summing over all 
€, gives a factor 2", which is cancelled by the 2% on the left side of Eq. (5.16). 
Each closed graph of length L thus contributes a term 7” to Q’; so 


Q' = LoD)T", (5.18) 


where g(L) is the number of closed graphs of length L that can be drawn on the 
lattice. Notice that g(L) = 0 unless L is even, so the sign of 7 (and therefore 
the sign of #) is immaterial, as we expected. 


Method of Finding Transition Point 


We now digress for a moment to show a method (due to Kramers and Wannier) 
by which the transition point of the Onsager problem can be found. This method 
is not powerful enough to yield an analytic expression for the partition function. 

Consider a closed graph on the lattice. Label the midpoints of the lattice 
squares #; = +1 if inside the graph and pn; = —1 if outside. (See Fig. 5.6.) 

Consider a bond (———) between midpoints of unlike y values. Each of these 
bonds corresponds to a bond of the closed graph (because each —— bond cuts 
a — bond). | 

We now have two lattices. The new lattice is formed by putting “‘atoms”’ 
described by p; at the midpoint of the old lattice. For the new lattice we can 
discuss Q at a temperature 7’ and such that the interaction energy is A in units 
at kT’. Then 


Qh) = YY TW*emins, 


all states 
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Fig. 5.6 Midpoints of lattice squares labelled + inside the closed graph and — outside it. 
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Ifu; = +y;,thenyju; = +1, and there isa contribution e*”. Ify, = —y,, 
then the contribution is e~*. 


Now, if in a given state there are L unlike bonds and 2N — L like bonds, 
then its contribution to Q(h) is 


ge  Ehg(2N-L)h 
so that 
Q(h) = e2Nh » m(L)e~ 244 
L 


where m(L) is the number of ways by which unlike pairs can be arranged on the 
lattice. 

But from the preceding argument (concerning -—— and — bonds), m(L) = 
g(L), where g(L) is defined by Eq. (5.18). 

Thus 


Qh) = e?%* \" g(L)e*™, 
Q(H) = 2" cosh?" HY. g(L)T*. 
L 


Let us now define h by e~*" = T = tghH. Then: 
Q(h) = tanh-” H ¥ g(I)T' 
and we get the identity 
O(H) = 2% cosh2® H tanh” H Q(h). 


Now, when # is large, h is small; if QU) has a singular point, so does 
O(h). That is, if there is a transition at H,,, there is also a transition at h,,. If we 
assume that there is only one transition or Curie point, H,, = h,, and H,, = 
—41n (tanh H,,). This can be solved to give 


H,, = 4 log (1 — V2). 
You might hope that we could use the identity 
O[x] = 2% cosh?” x tanh” xQ[—} In tanh x] (5.19) 


to find Q(x) for many values of x given Q(x) for one value of x. Just plug in x on 
the left side of Eq. (5.19) and get Q evaluated at another value, 


x’ = —4 In tanh x. 
We can then get Q evaluated at 


x” = —41n tanh x’. 


Unfortunately, x” = x, so this method is not useful. 
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Continuation of Onsager Problem 

Call h(/) the number of ways that, starting from the atom (origin), we can 
proceed in / steps (/ bonds) through the lattice and return to the origin without 
using the same bond twice. In A(/) we do not count separately the two or more 
different paths we can use to form the same polygon. 

For carrying out the calculation we shall need the sum over all different / 
polygons of the crystal. Our h(/) polygons for atom A in Fig. 5.7, say, will be 
counted again in the A(/) polygons of other atoms, say B in Fig. 5.7. Each 
polygon will occur / times. Therefore, the number of polygons per atom is A(/)/7. 
Let 


q= y AD ps (5.20) 


We have 
Qe=YAaADT*'= Y TT, 
L closed 
diagrams 
but need an expression for Q’ that we can hope to evaluate. We will work up to 
such an expression by finding an approximation to Q’, which we will then correct 
by means of a trick. 

A term from a closed diagram can be thought of as a product of terms made 
up of graphs formed by polygons. The contribution to Q’ from no polygons is 
simply 1. The contribution from graphs consisting of one polygon is Nq, for the 
number of graphs consisting of one polygon of length / is NA(/)//. The contribu- 
tion from pairs of polygons, one of length / and the other of length /’ is (NA(J)/1) 
(NA(I')/1')T'*" so the contribution from pairs of polygons of any length is about 


=) 


oe NAG)\ pise 
2 Ll I’ ‘ 


Origin 1=8 


Fig. 5.7 A polygon formed by proceeding through the lattice back to the starting point, 
without using the same bond twice. 
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where the factor of 1/2 is to account for the fact that we count each pair twice. 
So the contribution from pairs is about 4N7q*. I say “about” because we have 


counted such graphs as 


even though that pair of squares is not “‘closed,”’ since one bond appears twice. 


Furthermore 


is a pair of squares and is also a single polygon of length 8, so we have counted 
it twice. If we continue to ignore such errors, we get 


Q'~ 1+ Nq + (Ng) + 4(Nq)? +-°: = eM. (5.21) 


Our next step logically should be to correct g in order to get Q’ right. We 
can correct g by correcting A(/). Let us write down a few of the correction terms. 
Because of the presence of polygon 5.8a 


tj 
Fig. 5.8a 


in the first-order term in Q, we obtained the forbidden graph of 5.8b 


an Fig. 5.8b 


in second order (i.e., as a product of two polygons). To cancel graph 5.8b, we 
insert graph 5.8b into A(8) with a negative sign. But then in Q’ we seem to get 


terms such as 
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in both second and third order. In second order, there are two terms that form 
a diagram of type 5.8c, and both have a negative sign: 


ILI 
| tL 


that is, there are two ways of writing 5.8c as a product of 5.8a and a 5.8b type 
term. In third order, one of the two second-order terms is canceled by the 
positive product of three type (a) terms. To cancel the other second order term 
of type (c), we must insert (c) with a positive sign into A(12). 

In Eq. (5.21), we counted polygon 5.8d twice; therefore when we get a rule 
for what terms belong in our corrected /(/) and what their signs should be, we 
will have to have an excuse for not counting (d) in the corrected A(/). 


and Fig. 5.8¢ 


Fig. 5.8d 


Now, let us look at where we stand. In evaluating A(/) we expected trouble 
because of the restriction to certain allowed polygons. In correcting A(/) we 
found that there must be other polygons with not necessarily positive contribu- 
tion, and some of the allowed polygons should not be counted. The natural 
thing to do is assign each polygon an appropriate weight, and not ignore any 
polygon in the sum. Recall that formerly we did not count separately the two 
or more different paths we can follow when starting our from an atom and 
forming a polygon. Now we will count all the different paths, but we will choose 
the weight of each path so that our corrected “‘q” will lead to the right value of 
QO’ when we use Eq. (5.21). 


A Topological Theorem 
Give a weight « = e'*/* for a left turn and a! = e~ **/* for a right turn, and 


keep track of the left and right turns as a closed path (returning to origin) is 
traversed in the lattice. The topological argument of Kac and Ward is that the 
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closed graphs (or ones that we want to count) will be counted, and the forbidden 
graphs (e.g., Fig. 5.5) will cancel out when different ways of traversing them are 
considered. That this theorem is true for simple cases will now be demonstrated 
by a few examples. The complete proof was given by Sherman* and is rather 
involved. 

Consider the simple closed polygon in Fig. 5.8a. 

Starting at A and returning to A in a clockwise direction we note that 4 
right turns were made giving a total weight of 


opr Ae ss (e7 in/4y4 = ae 
Proceeding in a counterclockwise way gives a weight of 
(a)* = (eit/4)4 = _], 
The total weight is thus —2, and the simple polygon is indeed counted. To get 


the correct weight of 1, we multiply everything by —1/2. 
Now, consider the polygons in Fig. 5.8e and e’: 


Poe ea je ae tea 
the 7 a “il 
(e) (e’) 
Fig. 5.8 


In Fig. 5.8(e), the path is traced out in such a way (starting from A) that there 
are 6 left turns and 2 right turns for a total weight of «°(«~7) = (a*) = —1. 
In Fig. 5.8(e’) the path is traced out in such a way that there are 4 left turns and 
4 right turns for a total weight of «*a~* = 1. The total weight then of the two 
clockwise paths is —1 + 1 = 0. It is clear that the two counterclockwise paths 
have weight 0 also. 

Similarly you should be able to show that 5.8(b), 5.8(c), and 5.8(d) and all 
simple polygons give the correct contribution (remember that everything is 
multiplied by — 1/2). 


Method of Calculating partition Function 
Let g denote our corrected value of g. Then 


2 > {corrected h(I)} Ti 


a 1=1 l 
and 
Q' = et — 2T Oe —2 ¥ {corrected h(I)}T’ 
dT l=1 


* Sherman, S., J. Math. Phys., 1, 202 (1960); 4, 1213 (1963). 
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which is the sum over all paths leaving and returning to the origin of a complex 
“amplitude” or weight for each path. This amplitude is the product of ampli- 
tudes for each lattice point passed: 


T if one continues forward: 
: oe in/4. 
Ta if one turns counterclockwise: « = e'*/*; 


Tau} if one turns clockwise: «1 = ei”/*; 


0 if one turns 180°. 


In our rule for determining the amplitude for each lattice point passed, we 
have neglected to say how to deal with the first step, whose amplitude depends 
on the last step. In what follows, we will calculate the amplitude to start at the 
origin and arrive back at the origin from below, that is, moving upward. Thus, 
we will choose as our rule for the amplitude of the first step: 


T if one starts upward; 


Ta if one starts to the left; 
Ta” | if one starts to the right; 
0 if one starts downward. 


More generally, we will use the above rule for the first step and work with 
amplitudes for starting at the origin and arriving at any lattice point (x, y). 
We shall have to describe the amplitude by giving four components: U(x, y), 
the amplitude to arrive at x, y moving upward; D(x, y), the amplitude to arrive 
moving downward; R(x, y), the amplitude to arrive from the right; and L(x, y), 
the amplitude to arrive from the left. This, as mentioned before, is because the 
amplitude to make the next step depends on the arrival direction, and we shall 
have to keep track of it. 

Roughly, what we will do is calculate the amplitudes for arriving at (x, y) 
from various directions in exactly n steps, and then sum over n. U,(x, y) is the 
amplitude to arrive at (x, y) in exactly m steps with the last step upward and 
similarly for D,, R,, and L,. We assume that in the zeroth step we arrive at the 
origin moving upward. Thus U(x, y) = 6,96), Do = Ro = Lo = 0. 

In other words, the amplitude to arrive in zero steps is one if we arrive 
upward at the origin and zero for any other point or any other direction of 
arrival. With these amplitudes for the zeroth step, we can get the amplitudes 
for any number of steps by recursion. We can write for the amplitude to arrive 


upward at (x, y) inv + 1 steps in terms of amplitudes to arrive from (x, y — 1) 
in n steps. 


U4 1(, y) a TU,(x, y _ 1) sa TaL,(x, y = 1) za Ta~*R,(x,y 1). 
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If we came to x, y — 1 going up, we have an additional amplitude factor T. 
If we came to x, y — 1 from the left, we need to turn counterclockwise and add 
a factor Ja, and so on. We cannot get to x, y going up if we previously entered 
x,y — 1 going down, for we do not allow paths that reverse themselves. 
Similar expressions for R,(x, y), L,(x, y), and D,(x, y) are easily written. 

Then we see that our assumption for the amplitude of the zeroth step will 
give, for the first step, the correct amplitudes (such as T if one starts upwards). 
When summing over n, we will eventually have to subtract out the n = 0 term, 
because we do not consider a figure with zero side to be a polygon, even though 
it starts and ends at the origin. 

To simplify the summation over n, we define the transform of U,(x, y) 


Une] > > viene re™. (5.22) 
x=—-0O y=-o 
Then 


dé dn 
(2)? 


Qn 2n 
Ge | | een (E, 9) 
0 r@) 


For example, the transform of U,(x, y — 1) is e~™u,(é, n), and the transform of 
U,(x, y) is 1. The transformed equation then reads 


Un41(6 0) = Te "U,(E,n) + 0-e-"D,(E, n) + Tae” L,(E, 1) 
+ Ta te~™R(E, n). (5.23) 


and similarly for D+ i(S, N), Lit i(S, N)s and Ri+ i(¢, n). 
If we let w, be a four-component symbol (column vector) with components 
(U,, D,, Ly, R,), We can write the matrix equation 


Wn+ i(¢, n) = TM (C, ACS n) (5.24) 
where M(é, 1) is the matrix 


ein 0 ae = gte-l 

| 0 ef” qi teit = geil 

M(é, 1) = Pa cs | ee Pa e~ ig 0 . (5.25) 
aes gq telé 0 els 


Wols, n) = (i, 0, 0, 0). 
Wi(C,n) = TMY 
VW oe TMy, ee (TM) Wo 


Vy, = (TMY" Wo. 
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Then 


co 2 _ 1 
y= > (TM) Yo = 1— TM _ T™ Wo 


is the transform of the amplitude to arrive in any number of steps. The ampli- 
tude to arrive at the origin moving upward 1s 


y U,(0, 0) = + Du U,(0, 0) = -1+ Wo: {. x ei60 pind 


1 dé dy 
x ee 
1—TM  (2n)? 


2x ('2n 1 7 dé dn 
\ \ (¥. 1_ TM ) (2m)? 


We could have arrived at the origin moving in any direction, so we must sum 
over all four values of Wo, that is wo = (1,0, 0,0), wo = (0, 1, 0,0), wo = 
(0, 0, 1, 0), Ww = (0, 0, 0, 1). The result is a trace. Thus, 


-27 4 aan {7 i Peer ea) dg di (5.26) 
1— TM (27)* 
ode dn 1 By 
aa \. c (2x)? tr] | - 2t(1 — tM) ag A o 
-(- |. dé dn ain —M it 
(Qn? 2(1 — tM) 


a . e a Tr [log (1 — TM)] 


aw | = . ae a log det (1 — TM), (5.27) 
(27) 


we have used the following theorem: 


Tr log A = log det A. 


To prove this identity, note that 


N 
lim (det e3/%)% = lim ae (: + y)| 


N-> © N-~o 


det e? 


| 
5 
, ie 
—_s 
+ 
zie 
4 
w& 
No 
lI 
® 
ar) 
by 


148 Order-disorder theory 


Therefore, log det e? = Tr B. Let B = log A. Then Tr log A = log det A. 
M is given by Eq. (5.25), and one obtains 


det [1 — TM] = (T? + 1)* — 2T(1 — T*)(cos E€ + cosy). (5.28) 
Summarizing, we see that 
BF/N = —In QO/N, 
Q = 2\(cosh?” H)Q’, 
Q’ =e 
q= iL oe log det (1 — TM), 


0 


det (1 — TM) = (T? + 1)? — 2T(1 — T?\(cos € + cos) 
T = tanh H. 
Putting all this together we obtain* 


BF 1 2x (2x dé dn . 
ee aly eo In [cosh? 2H — sinh 2H 
a ; | { On)? [ sin (cos € + cos n)| 


(5.29) 


5.55 MISCELLANEOUS COMMENTS 


In the above we sometimes assumed that the lattice was finite (with N points) 
and we sometimes assumed it was infinite (when we neglected the boundary for 
example). Because NV was assumed to be very large, it is plausible to suppose 
that our inconsistency made little difference. However, let us modify the above 
work to make it consistent. 

Assume the lattice has the topology of a torus. If we pass through L points 
in one direction (or L’ points in the perpendicular direction) we finish back 
where we started. Formerly, to compute the amplitude to arrive back at the 
origin moving upward, we took the sum 


y U,(0, 0). 


1 


n 


Now, we must take the sum 


i.) 


x Y ¥ UL, gL). 


p=—@ q=—o n=1 


* A slightly different treatment for the Onsager problem was given by N.V.Vdovichenko, 
Soviet Phys. JETP 20, 2, 477 (1965); also see Soviet Phys. JETP 21, 2, 350 (1965). 
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Continuing as before, we get in place of Eq. (5.29), 


BF ae 1 2n (2x dé dn y gipLé piaL'n 
N 2 0 0 (21)? Pq 


x In [cosh? 2H — sinh 2H(cos € + cos n)]. 


a> 5(¢-). 


p=-@ n>=>-—-© L 


BF_ ino 1 § § (22) (27)\_1 
is PPG lai 


x In cosh’ 2H — sinh 2H (co am + COs Falk (5.30) 


But 


So 


As L and L’ tend to infinity, Eq. (5.30) approaches Eq. (5.29). 

A few comments about the form of Eq. (5.29) are in order. If we use the 
expression 

BF = —log ¥ e ** 
States 

we can verify that Eq. (5.29) is correct for high temperatures (H — 0) and low 
temperatures (H — oo). Another check of Eq. (5.29) is by way of Eq. (5.19). If 
we consider Q as a function of y = sinh 2H instead of as a function of H, Eq. 
(5.19) can be written in an especially simple form. Because sinh 2H = 2 cosh H 
sinh H = 2 cosh’? H tanh H, 


O(y) = 2" cosh?’ H tanh” HO (5) = sinhY 2HO () = yo (|) _ (5.19a) 
| y y y 


Equation (5.29) can be written 


BF 17? ("de dp 1 
Pee i) ce ln i = In) ee 4: 
ny 4 Iny 2), |, Gm? y : (cos € + cos n) 


(5.29a) 


from which it can be seen that Eq. (5.19a) holds. 

Kac made the conjecture that the solution for the three-dimensional 
Onsager problem (cubic lattice), is of the same form as Eq. (5.29) except that 
cos € + cos n is replaced by cos € + cosy + cos ¢ and dé dy/(2n)” by dé dy 
dt/(2x)*. However, this conjecture is false. 

Instead let us conjecture that the specific linkage of the Ising lattice will not 
greatly affect the character of the transition point in the specific heat curve. 
Also, an approximation starting from the transition point should prove very 
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valuable. If a magnetic field is added to the Onsager problem, the problem of 
evaluating the partition function can again be converted to a path problem, 
which has not been solved. The problem here is to keep track of the area en- 
circled as well as the length of the path. 

We have done a lot of work just to evaluate the expression 


Q= Y exp (-# » et) ; 
eg= a1 bonds 

One reason for going to all this trouble is that maybe you will be able to general- 
ize the methods used, or apply them somewhere else. For example, we considered 
a Statistical-mechanical problem for which we eventually had to evaluate a sum 
over all paths leaving and returning to the origin, with an amplitude depending 
on the preceding step. This is essentially a Markovian walk, with each step 
depending only on the one previous step. A Markovian-walk problem, however, 
can be handled when each step depends on a specific finite number of preceding 
steps. In a way analogous to the method used above, Q, can be found for the case 
where the amplitude of a step depends on the 1 steps preceding it. The question 
is: Can one find an interesting problem whose partition function is Q,? 


CHAPTER 6 


CREATION AND ANNIHILATION OPERATORS 


6.1 A SIMPLE MATHEMATICAL PROBLEM 


In this chapter we shall describe an operator formalism that has widespread 
applications in quantum mechanics, notably in dealing with harmonic oscillators 
and in describing many-particle systems. 

We begin by formulating and solving the following simple problem: 
Suppose an operator a satisfies 


[a, a*] = 1. (6.1) 


The problem is to find the eigenvalues of the Hermitian operator a*a, and to 
relate the eigenvectors. (Note: a* denotes the Hermitian conjugate of a, and 
[A, B] is, of course, the commutator AB — BA.) 

We first note that, if |«) is a normalized eigenvector with 


a*a\ax)> = ala), (6.2) 
then 
a = <ala*ala) = lala) |? > 0. (6.3) 


That is, the eigenvalues are all real and nonnegative. Using the identity 
[AB, C] = A[B, C] + [A, C]B, we observe that 


[a*a, a] = [a*, ala = —a, (6.4) 
[a*a,a*] + a*[a,a*] =a’; (6.5) 
or, equivalently, 
(a*a)a = a(ata — 1), (6.4’) 
(a*ajat = a*(a*a + 1). (6.5’) 


From Eq. (6.4’) we have, for an eigenvector |«), 
(a*a)a\x) = a(a*a — 1)|a) = a(x — 1)|a> = (a — ala). (6.6) 


Therefore ala) is an eigenvector with eigenvalue « — 1, unless ala) = 0. 
Similarly a* |x) is an eigenvector with eigenvalue « + 1, unless a*|a) = 0. The 
norm of ala» is found from 


ale) |? = <ala*ala) = aXala> = a, 
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or 
ale) || = Va. (6.7) 
Similarly, 
a* |a) || = Vo + 1. (6.8) 


Now, suppose that a"|«) # 0 for all n. Then by repeated application of 
Eq. (6.6), a"|«» is an eigenvector of a* a with eigenvalue « — n. This contradicts 
Eq. (6.3), because a — n < 0 for sufficiently large n. Therefore we must have 


ala) #0 but a"*|a) = 0 (6.9) 


for some nonnegative integer n. 
Let |a — n>) = a"|n)/ \la"|n) ||, so that [a — n> is a normalized eigenvector 
with eigenvalue « — n. Then from Eqs. (6.7) and (6.9), 


Va —n= |lale — n)|| = 0, 


and therefore « = n. This shows that the eigenvalues of a*a must be nonnega- 
tive integers, and that there is a “ground state’”’ |0) such that 


al0) = 0. (6.10) 


_ By repeatedly applying a* to the ground state we see that (a*)"|0) has the 
eigenvalue n and, because of Eq. (6.8), it is never zero. Thus the eigenvalues of 
a*a are 0, 1, 2, 3,.... 

If |x) is a normalized eigenvector with eigenvalue n, then, from Eq. (6.8), 


In — 1) = (1/Wa)aln) 
is a normalized eigenvector with eigenvalue n — 1. Also 
a*|n—1) = (1/Vn)a* aln) = Jn|n). 


So applying a* to |n — 1) gives us back |n) (within a factor), rather than some 
other state with eigenvalue 7. 

We may then construct the eigenstates of a*a as follows: First we find a 
state |0) such that 


al0) = 0. (6.11) 


(|(0> may be unique; if not, we find other operators commuting with a and a”, 
and classify the |0)’s according to their eigenvalues.) Then we define 


zs 


/2 


I) = a*l0; [2) =—L atl) = 5 FO: 
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and in general 
1 
ln> = var (a*)"|0>. (6.12) 
n! 


(Note that we could have included arbitrary phase factors in the definition of 
\n); our convention here is to make them unity.) With this definition, the |) 
are orthonormal* and satisfy 


a*\n) = Jn + 1|[n + 1) (6.13) 
a\n) = Vn|n — 1) (6.14) 
a*a|n) = n\n). (6.15) 


Equations (6.11) through (6.15) form the answer to the problem posed at the 
beginning of this section. 

The operators a* and a are called “‘raising’’ and “lowering” operators, 
respectively, because they raise and lower the eigenvalue of a*a. In later 
applications a*a will be interpreted as the observable representing the number 
of particles of a certain kind, in which case a* and a are called “‘creation”’ and 
“annihilation” (destruction operators, or “emission” and “absorption” 


operators. Equations (6.13) and (6.14) may be alternatively expressed in terms 
of matrix elements: 


<mlat \n) = Vn + 1 Sn tts (6.13’) 
(mla|n) = Vn ban —t- (6.14’) 
* For, by (6.12) we have 
(nlm) = Ola"(aty"|0 >A /x/n!m)). 
From Eq. (6.1) we easily obtain 


[a,(a*)"] = n(aty"~*, 
so that 


(Ola"(a*y"|0) = (Ola"~*(aty"al0) + O|na"-!(a*y"- 110) 
n(Ola"~*(a*y"—"10) 
n(n — 1)---(n — m + 1)Ola"™"0) 


= Nn! Onm 


and the orthonormality follows. 
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6.2 THE LINEAR HARMONIC OSCILLATOR 


Our first application of the results of Section 6.1 will be to the one-dimensional 
harmonic oscillator, which has a Hamiltonian of the form 


2 
H = — p? + —— x3, (6.16) 


where x and p are the position and momentum operators for the particle and 


satisfy 
[x, p] = ih. (6.17) 


Our task is to find the eigenvalues and eigenstates of H. 
Note that / (ma@/h)x and (1// mah)p are dimensionless. Let us define 


ge +. (cere ee P). 6.18) 


Because x and p are Hermitian it follows that 


1 [ |mo 
rae mes (ON ile 6.19 
° +( h . - i r) ve 


From Eq. (6.17) we obtain 


[a, at] =1. | (6.20) 


Expressing x and p in terms of a and a*, we have 


ER + 
te. (6.21) 
mo 4/2 
a* 
p= Vmoh * (6.22) 


a 


We get, for the Hamiltonian, 
- ho + +) _ + 1 
H = 5 (a*a + aa‘) = ha(a‘a + 4). (6.23) 


Thus, the eigenstates of H are those of a*a. Now we can apply the results of 
Section 6.1, obtaining the eigenstates |0), |1)>, |2>,... that satisfy 
A|n) = (n + Hhaln). (6.24) 


The energy levels are thus E, = (n + 4)ho. 
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The eigenstates themselves are given by Eqs. (6.11) and (6.12). We can 
easily obtain the wave functions 9,(x) = <x|n) as follows: from Eqs. (6.18) 
and (6.11), 


0=al0d) = (W?(x+— p))oo. (6.25) 
2h mo 
Applying <x| and noticing that (x|p|o>) = —ih(d<x|)/dx), we get 
mo h d 
ee - ee ee 0 6.26 
2h (x mo a oy o79) 


(where x is now a number, rather than an operator.) Equation (6.26) is merely 
Eq. (6.11) in coordinate representation, in which it takes the form of a differ- 
ential equation. Solving it, we get 


<x|0» = Ae” (mo! 2h)x? 


where A is a constant. Normalization requires that 


00 00 


<O|x><x|0) dx = |Al? | eW (ma/h)x* qy 


— 0 


1 = <0|0) -| 


— 


= |Al? = 
mo- 


SO 


pee 2 (a a 
7 th 


The phase 6 of A is arbitrary, and we set it equal to zero. Then 
1/4 
‘ 7 (cE) 
th 


1/4 
<x|0> = ("a g~ (mco/2hi)x2 (6.27) 


sO 


We have thus found the wave function for the ground state. For the other 
states we apply a* according to Eq. (1.12): 


pies 


<x|n> = a 


<x|(a*)"|0). (6.28) 
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Since 
dxlat = [2 co (x mes P| 
mo h d 
= Se ee Ss <x|, 
2h mo dx 
we have 


_ 1 (moy"(,_ h ay 
rs (5a) (x mo =) <x! 


1/4 n/2 n 
“e(S)" GY e-Beyrme om 
ni\&% M@ ax 


The matrix elements of observables between harmonic oscillator states can 
be found without having to express the states in coordinate representation and 
integrating over x. We simply express the observable in terms of the raising and 
lowering operators. An example of this procedure is given in the following 
section.* 


6.3 AN ANHARMONIC OSCILLATOR 
Suppose a system has the Hamiltonian 
2 2 
=— + oe x? + Ax*, (6.30) 


Assume that / is small enough (« hq@) that we can use first-order perturbation 
theory, treating Ax* as a perturbation of the Hamiltonian (Eq. (6.16)). Then 
the perturbed energy levels are 


E, = (n + 4)hfow + A,, (6.31) 
where 
A, = <n|Ax*|n). (6.32) 
From Eq. (6.21) we have 
2 
eee e- <n\(a + at)*In). (6.33) 
2m@ 


* Problem: Prove that 


= 3) Se) [le PFY] 
7 


where f(a*) is interpreted as >* a,(a*)", when f(x) = }on a,x". From this formula, find 
a generating function for (x|n). (Hint: Prove first that [a, f(a*)] = f’(a*).] 
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Expanding (a + a*)* gives us 16 terms, but, thanks to the raising and lowering 
properties of a* and a, the only terms giving a nonzero expectation value are 
those with two a’s and two a*’s: 


<n\(a + a*)*|n) 


<n\(ata*aa + ataa*a + a*aaa* 

+ aatata + aataa* + aaa*a™)|n) 
nn — 1) +n? + n(n + 1) 
tnrn+1)+am4+1)?+ (+ I(r t 2) 
6n? + 6n + 3, 


where we have used Eqs. (6.13) and (6.14) repeatedly. Therefore 


2 
Am ia (Qn? + 2n + 1). 
2m@ 


6.4 SYSTEMS OF HARMONIC OSCILLATORS 
Suppose a system has the Hamiltonian 


H=Y.-PP+ ¥ V2.0, (6.34) 


i mM; 
where Q, and P, are canonical coordinates and momenta: 
LO: QO; = [Pi P;\ = 0; [O;, P;\ = iho ;;, (6.35) 


and V;; = V;;. To simplify the presentation a little let us make a change of 
scale, defining | 

ga = Vm; py = Pim, (6.36) 
and 


2 


U;, = Vij. (6.37) 
jn; 


J im 


Then q; and p; are also canonical: 


qi. pjl = iho jj; (6.38) 
and in terms of them the Hamiltonian is 
H=% 2 pp +4 » U; 5:4 j- (6.39) 
i i,j 


We shall express H in terms of raising and lowering operators as we did for the 
one-dimensional oscillator. The procedure involves two steps: the first is 
finding a set of normal coordinates g, with respect to which the potential is in 
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diagonal form, and the second is expressing the coordinates and momenta in 
terms of raising and lowering operators. 
Let the coordinates q; and qg, be related by 


= ¥ Cua (6.40) 


Because (U;;) is assumed to be real and symmetric, the transformation matrix 
C,, that diagonalizes it is orthogonal: 


» CriCgi = ag; > CyiCaj = 0;;- (6.41) 
The inverse transformation of Eq. (6.40) is then 
=) Cuda: (6.42) 


We further assume that the eigenvalues of (U;,) are all positive, that is, that 
the matrix is positive definite (this ensures that qg; = 0 is a point of stable equi- 
librium). Denoting these eigenvalues by w2(w, > 0), we have 


2 Cail pj Ury = 025 aps 
and thus 
y Uiaids = LD Ode. (6.43) 


i,j 


Finally, we define pf, in such a way as to preserve the canonical commutation 
relations: 


Pa = » CuiDis (6.44) 
[Gu Bp] = ihdgg. (6.45) 

The result of our efforts is that 
= 4) (p7 + wiq2), (6.46) 


which means we have a system of decoupled harmonic oscillators (one for each 
value of a). | 7 
Using the methods of Section 6.2, we form lowering and raising operators 


for each mode: 
eee ee ae — Be). (6.47) 
2h = ( _ Oy 


a na (6.48) 
_ = Ja (ve : Jac) 
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Gi, = J - (a, + a,'), (6.49) 
20, 

; [he . 

Bp, = -if a (a, — a, ). (6.50) 
Then, 

[a,, 4g] = [az, ag ] = 9, (6.51) 
[as az | —- Oaps (6.52) 
H = ¥ hofaja, + 4). (6.53) 


The eigenstates of H are described by giving, for each «, the eigenvalue n, of 
a‘a,. Thus, 


H|nyn,n3°°* > = > (n, + 4)hw,|nyn,n3°** >, (7.54) 
Injn.n3°**> = hy | |000 --- >, (6.55) 
a Vn,! 


where the ground state |000- - - ) is defined by 
a,|000:-- > = 0 for all . (6.56) 


Note that the energy of the ground state is >}, 44w,. For a system with 
infinitely many degrees of freedom (which we will consider shortly), this quantity 
will generally be infinite. Because the zero point of energy is a matter of defini- 
tion (only the difference between levels being of physical importance), it is 
convenient to redefine the Hamiltonian of such a system so that the ground- 
state energy is zero. Thus, if we let 


H =4)' (p27 + w2qz — }ho,) (6.57) 


with a corresponding (but more complicated) expression in terms of the original 
coordinates g;, then 


H = ¥' ho,a; a,, (6.58) 
and 
H|n,n.n3°-:> = ¥n,ha,|nyn.n3 °°: >. (6.59) 


6.5 PHONONS 


The states of the system considered in the preceding section can be given a 
simple interpretation in terms of “noninteracting phonons’’. Assume that the 
Hamiltonian is given by Eqs. (6.57) and (6.58), so that the energy of the ground 
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state is zero. The ground state is then called the ‘“‘vacuum state” and represents 
the state of the system in which there are no phonons. If the system is in the 
state |n,n.n3---) we say that there are n, phonons of type a(a = 1, 2,3,...). 
The n, are called “occupation numbers.” Note that the energy of this state is 
nha, + nha, +---, so that the energy of a single phonon of type « is ha,, 
and the total energy is the sum of the energies of the individual phonons. In 
other words, the phonons are noninteracting. 


Since 
az |ny ca Ny—1NgNg+1 ans » = Jn, a I|n, ee) Ng—1> (n, + 1), Ny+4 ae yy 
(6.60) 
a,|ny lis Ny 4NgNg+1 “es » _ Jn,|n, vee y Ny-4; (n, == 1), Ng+4 °°" > (6.61) 


we may call a* and a, creation and annihilation operators for phonons of type a. 
The operator for the number of phonons of type « is a; a,, and the operator 
for the total number of phonons is 


N = ¥ aj a,. (6.62) 


Let the vacuum state be denoted by |0), and let 
la = a |0) = |0,...,0,1,0,...), (6.63) 


be the state with n, = 1 andn, = Ofora’ # a. A phonon, then, is the system 
that is decribed by the states |x» (i.e., the system whose quantum-mechanical 
Hilbert space is spanned by the |a»). If |«> and |B> are (one-) phonon states, 
then 


<a|B> = (Ola,a¢ |0) 
COl(ag a, a Jap)|0> 
= 5, (6.64) 


so that the states |x) are orthonormal. To each normal mode of vibration of the 
original system of harmonic oscillators corresponds a one-phonon state (since 
they are both indexed by «). 

We can also use a similar notation for states of two or more phonons, 
defining 


ls, +05 > = af °°: az|0). (6.65) 


These states are normalized as they stand if «,,..., «, are distinct; otherwise, 
they have a norm larger than 1. Assume for definiteness that a takes on the 
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values 1, 2, 3,.... Then the state with n,, of the a’s equal to 1, n, «’s equal to 2, 
and so on, is 


iy ice gle De 2cee 2ivtc yy SAG, ) (a) "10 
oe 


= JVny!ng!-+* |nynz +++) (6.66) 


so that its norm is Vn! ee 

When we deal with only a few phonons at a time, it is usually more con- 
venient to use the notation |«,,..., %,) rather than the occupation-number 
description |n,n,---, especially when « can take on a continuum of values. 
The effects of creation and destruction operators on |a,,..., %,) are 


Gigs oO, oS Oe Cigars Oa (6.67) 


G)eis ven > = , aan %1> se peg ae cn) (6.68) 


im 


Equation (6.68) comes from Eq. (6.65) and the relation 


n 
+ eoee + — + eee + +: see + 
[a,, Gg, ay, = x Sama, Day Vans 4 ay 


Note that phonons act like Bose particles (insofar as we can call them 
particles*), as an arbitrary number of them may be in any given state (i.e., 
|x,...,@) exists for any number of @’s). Their Bose nature is also reflected in the 
symmetry of the states (e.g., ja, 8B) = |B, a>). In Section 6.7 we shall show how 
the ordinary rules for quantum-mechanically describing systems of many Bose | 
particles lead to a set of states and operators with the same form as those 
obtained here, so that the interpretation of the oscillator as a system of many 
Bose particles is correct. 

We conclude this section by considering the qualitative effect of an anhar- 


monic perturbation on the oscillator system. Suppose the perturbation has 
terms of the form 


2 Vs ji jk and ay V5 ji je Q1- 


1, Jk i, j,k,t 
In terms of creation and annihilation operators, the cubic terms are of the form 
+i +o + + + 
A, 4p A, , Ay Ag Q,,..., Aaga,, 


which always changes the number of phonons (e.g., the first term creates three 


* The phonons will look more like particles (e.g., carrying momentum and energy) 
when the oscillator system is a field as described in the next section. 
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new phonons). Thus, if we start with a definite number of phonons and let the 
Hamiltonian drive the system forward in time, 


W(t)) = e7**|W(0)> 
= |W(0)) — itH|W(0)) + O(¢7), 


we will soon start finding different numbers of particles. The quartic terms 
similarly change the number of particles, except for terms like 


+ + + + 
Ay Ag AyAs, Ay Aga, As,..., 


which conserve the number of particles but act as a mutual interaction between 
them; that is, the particles are no longer independent. The description of a 
mutual interaction will be considered in more detail later. 


Exercise: Verify that the number-of-phonons operator N, defined by Eq. (6.62), 
commutes with a product of creation and destruction operators if and only if 
the number of a*’s equals the number of a’s in the product. 


6.6 FIELD QUANTIZATION 


A notable example of a system with infinitely many degrees of freedom is a 
field. Examples are the amplitude of sound waves, drumhead vibrations, light, 
and so on. Consider a real scalar* field g(x) whose motion? is described by the 
Lagrangiant 


ogres | d3x9(x)o(x) — 4 | dx | d3x'K(x — x/)o(x)o(x’), (6.69) 


where K(x — x’) = K(x’ — x). The classical equations of motion, found by 
varying (x), are 
0 = 0 OL oL 
Ot dg(x) d(x) 


= @(x) + {4 3x'K(x — x’)p(x’). (6.70) 


* The following procedure can be generalized for a multicomponent field by putting 
indices on everything. 

+ Classically g(x) depends on ¢, but (as with q; in Section 7.4) we will not show it 
explicitly. Besides, in the Schrédinger picture, the operator g(x) is time-independent. 
t We assume that the system is invariant under translations, so that K is only a 
function of x — x’. 
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Note how Eqs. (6.69) and (6.70) resemble the corresponding equations for 
the system of harmonic oscillators described in Section 6.4: 


L= 1G os +2; U 5 j9:9j> 
i i,j 
0O= 4; + > U; /4j- 
j 


Thus, we are justified in treating the field as a system of harmonic oscillators 
(at least formally): @ corresponds to the symbol “‘q” and x corresponds to i. 
~o(x) can be thought of as a separate coordinate of the system for each x. 

As an example, suppose that 


K(x — x’) = —@?V’6°(x — x’). (6.71) 
Then Eq. (6.69) becomes, after a few integrations by parts, 


L=4 | d3x[4(x)0(x) — cVo(x)- Vo(x)], (6.72) 


and Eq. (6.70) becomes 
I ox 
V7e(x) — 2 @(x) = 0, (6.73) 


which is the usual wave equation. 
If we assume that @(x) is a coordinate of the system for each x, the conjugate 


momentum to ¢(x) is 
OL 


The Hamiltonian is then 


H 


| d°>xTI(x)@(x) — L 


4 | d°xII(x)M(x) + 4 | d°x | d*x'K(x — x')o(x)o(x'). (6.75) 


To quantize the system we let y(x) and II(x) be Hermitian operators satisfying 

[o(x), o(x')] = [I(x), Mx’)] = 0, (6.76) 

[ p(x), T(x’)] = ihd°(x — x’), (6.77) 

and assume that the Hamiltonian is given by Eq. (6.75), except for a scalar term 
to make the ground-state energy zero. 

We next express everything in terms of “normal modes.” The situation 


turns out to be slightly different from that of Section 6.4 because it is con- 
venient here to use “complex”’ (that is, non-hermitian) normal coordinates. 
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Because the system is translationally invariant, we expect that it might help 
to express the fields in ““momentum”’ representation. Therefore we define 


@(k) = | axoceen (6.78) 
Ii(k) = | d>xII(x)e7i***, (6.79) 
The inverse transformation is* 
Qe jks 
= | —— O(k)e™* 6.80 
(x) | kos (6.80) 
with a similar expression for I(x). Since g(x) and I(x) are Hermitian, we have 
O*(k) = @-K;  T*(k) = T(—&). (6.81) 
From Eqs. (6.76) and (6.77) we obtain 
[G(k), G(k’)] = [(k), N&’)] = 0, (6.82) 
[@(k), (k’)] = ih(2n)°d°(k + k’). (6.83) 
Now let 
w*(k) = | dteKape (6.84) 


From K(x) = K(—x) = K*(x) it follows that 


w?(k) = w*(k)* = w?(—k). (6.85) 
Rewriting the Hamiltonian of Eq. (6.75), we get 
1 ( d°k = ~ ppp ons 
= 5 (2n)3 [Ti(—A)I(k) + w*(k)G(—k)G()] 
=1 (4 trdofiay + 070)5* oH) (6.86) 
2 | (2n)° 


We assume that w?(k) > 0, so that the Hamiltonian is positive definite. [Thus 
w(k) is real, and we take w(k) > 0.] 
In the example described in Eqs. (6.71), (6.72), and (6.73), w(k) = c[k]. 


* Throughout all of this the following integrals will be useful: 
dk 


ik -x — §s3 
(ony? e O0°(x). 


[are ‘* = (27)°d°(k), 
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Next we will define annihilation and creation operators (compare Eqs. (6.47) 
through (6.50)): 


1 See 
k) = ——]| Vo(k) Gk) + nw |. (6.87) 
a(k) il ool) G(R) = 
1 — i _ 
*(k) = —| Vo(k) @(—-A) — = f—-#)]. (6.88) 
a*(k) il 2) H-®) - 
or 
Gk) = Vite [a(k) + a*(—k)], (6.89) 
fick) = par Sa, (k) — a*(—b)]. (6.90) 
The commutation relations are, from Eqs. (6.82) and (6.83), 
[a(k), a(k’)] = [a*(k), a*(k’)] = 0, (6.91) 
[a(k), a*(k’)] = (272)353(k — Kk’). (6.92) 


If we write H in terms of a and a*, making the change of variable k > —k 
when necessary, we obtain 


(2n)° 


plus a correction term to make the vacuum energy zero. The corrected 
Hamiltonian is evidently 


i= : | dK eo(k)at(Wa(k) + a(k)a*(k)] 


d*k 
(27)° 
[Note that the correction term is the infinite quantity —4 | d*khw(k)6°(0)]. 


Finally, we express the original field variables in terms of the creation and 
destruction operators, using Eqs. Ce (6.89), and (6.90): 


ho(k)a* (k)a(k). (6.93) 


Q(x) = lee (On re a(kje*'* + at (ke? ] (6.94) 


M(x) = lo: Te me RAK) F _ igkel* + iat (ke **]. (6.95) 


Equations (6.91) through (6.95) are the important results of the quantization 
procedure. 


The commutation relation of Eq. (6.92) may appear strange, in that 
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[a(k), a*(k)] is infinite (instead of unity), so that the analysis of Section 6.1 
does not apply directly to a(k). Suppose, however, that we use a less singular 
representation. Choose a complete orthonormal set of functions w,(k), where « 
is a discrete index: 


| = (kW ,() = bap (6.96) 
YVAN) = (nyo — &+) (6.97) 
and define 
a, = | a W*(Wa(k). (6.98) 
Then 
[a,, ag |] = dag, (6.99) 


so that we can apply previous results and construct |n,---n,-::). But these 
states may not be eigenstates of H. The states 


Ik) =a*(kK0), —|k, k’) = at (ka*(k’)I0), 


and so forth, though unnormalizable, are eigenstates of H. 

What kind of phonons do that unnormalizable states represent? The state 
|k> is a phonon of energy fiw(k), and we may also say that it has momentum 
hk. To discover the reason for this, consider the operator 


P= avk hka* (k)a(k), (6.100) 
(27)° 
which satisfies 
[P, a*(k)] = hka*(k), [ P, a(k)] = —hkatk) (6.101) 
so that 
P\k,, k2,...) = (hk + hk, +--+ )|ky, ko,...,). (6.102) 
Now, from Eqs. (6.94) and (6.101) we obtain 
[P, o(x)] = ihVe(x). (6.103) 
One can then show that 
et Plit(y)e 2 P/* = (x + a), (6.104) 


so that P generates translations and is therefore the momentum operator. 
In Sections 6.7 and 6.8 we will describe further the relation between the 
operators and the states they create and destroy, as well as how other operators, 
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such as the Hamiltonian, can be written in terms of the creation and destruction 
operators using any basis of states. 


Note: In relativistic quantum mechanics, when one quantizes a free-particle 
field with w(k) = Jk? + m? (h = c = 1), a different normalization and 
summation convention is commonly used for momentum states. Everything is 
written in terms of { d°k/(2z)?2a(k) and (27)*°2a(k)6°(k — k’), which happen 
to be relativistically invariant; to accomplish this change of normalization one 
uses a “relativistic” a(k) equal to A 2w(k) times our a(k). Equations (6.92), 
(6.93), and (6.94) then become 


[a(k), a*(k')] = (22)°2a(k)d°(k — k’); 
d*k Fi 
H = | —.—., ofk)a"(k)alk); 
(27)?2m(k) 
d°k 
(21)*2@(k) 
In some texts the (27)? is also treated differently. We will not use the relativistic 


normalization here, but it is mentioned in case the reader finds it elsewhere and 
wants to reconcile it with our notation. 


(x) = [a(ke™* + at (ke **]. 


6.7 SYSTEMS OF INDISTINGUISHABLE PARTICLES 


In the preceding sections we considered the quantum states of an oscillator 
system as being states of various numbers of a “particle” called a phonon. We 
identified certain states as one-phonon states, and others as states containing 
more than one phonon. 

In this section we shall follow a different line of reasoning. We will start 
with a space of states describing a single particle, either Bose or Fermi, and 
construct the multiparticle states according to standard methods. For the Bose 
case we will arrive at a system of states and operators that is mathematically the 
same as that found previously for a system of oscillators, thereby showing that 
the interpretation of oscillator states as many-phonon states is consistent with 
the usual description of many-particle systems. In the meantime we will have 
also developed a formalism for dealing with Fermi particles, for which the states 
do not resemble those of a harmonic-oscillator system. 

We will treat the Bose and Fermi cases simultaneously, distinguishing them 
by the number (¢; 


+1 if the particles are Bose 
C= ; (6.105) 
—1 if the particles are Fermi. 
We will use the symbol C” (where P is a permutation) to denote | for the Bose 


case and (— 1)? for the Fermi case. 
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Consider first the case of distinguishable particles. If |W,>,..., |w,,) are 
one-particle states, then 


> = Wadl2>> >> lad (6.106) 


describes the n-particle state with the ith particle in state Wo. If |lo> = 
li) 1@2)-- + |@,), then we write 


COlW> = (Ki lC al? + COaD CW Wad * + Wad) 
= (P11Wi><@2lW2)° _- CPnlWa»s (6.107) 


which defines the inner product <g|y). The Hilbert space describing the 
n-particle system is that spanned by all nth-rank tensors with the form of 
Eq. (6.106). 

The state in which the ith particle is localized at the point x; is |x,)|x.>-°- 
|x,». As each x, runs over all space, the resulting states form a complete 
orthonormal set for the n-particle space (ignoring spin and other variables): 


(xg) ++ Cxql 191+ + Lad) = Be, — yi) ty — Yy)s (6.108) 
| dx,- + | d>x,(\x>°- + Lx Qxil + Cel) = 1. 6.109) 


Using this basis we can express the n-particle states in coordinate representation: 


W(X 15. - +5 Xn) = (Cal DIY. (6.110) 
For the particle |W») of Eq. (6.106), we have, using Eq. (6.107), 
W(X). ae Xn) _ W(x Wn : (Xa) (6.111) 


where w(x;) = <x,lWi). 
Next, let us consider indistinguishable particles. We assume that the particles 

obey Bose or Fermi statistics, which means that we must symmetrize or anti- 

symmetrize, respectively, the states obtained in Eq. (6.106). We therefore define 


IWi> x IW2> x ++ x lv.> = (1/v'n!) » Clea Wray eye Wrcnys (6.112) 


where P runs through all permutations of n objects. It will often be convenient 
to write IW, Wo; ee a2 Wn) for vi) x v2) cenaaIE lw, 

The space of n-particle states is that spanned by all “‘products’’ of the form 
of Eq. (6.112). Note that |W,> x--- x |W, is totally symmetric in the Bose 
case and totally antisymmetric in the Fermi case, as it should be. 
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Example: Let |a) and |b) be two single-particle states. If ¢ = +1 (Bose 
particles), 


la) x |b) = |a, b) = 55 (larlb» + |b)|a)), 


la) x ja) = ja, a) = /2Ia la). 


If ¢ = —1 (Fermi particles) 
la, by = ~~ (|a)|b) — [b)Ia)), 
V2 
la,a> = 0. 


Thus, we have the expected result that two Fermi particles cannot be in the 
same state. 

What is the inner product of two of these n-particle states? The answer is 
given by the following theorem* 


- i) a CPilWi> = CPilWn> (6.113) 


aeieoren ent Lonls) <= Cala) |e 


where, for any n xX n matrix A = (A;,), 

| Al, = » CPArpay _ A, P(n): (6.114) 
That is, |A|_ 1s the determinant of A, and |A|, is what is often called the 
permanent of A. 


Proof: 


CP, et Oa Wa cance Wn) CPE (Opal "re (Ppa Woay> oe Won) 


! 
= 
mM 
a 


= 
a 


2 OPC @paylWaay ie (PrmlVom? 


| 


~M! 


» CPC oy Wop-i(1)> — (PulWor-1ny> 


= 


(where we have permuted the factors by P) 
1 -1 
= a ye » cee (Pi \Wop-11)) tote CPulWor-1ny)> 


* Compare this theorem with the well-known formula for vectors in 3-space, 


a:‘ca:d 


b) - d) = 
pa nanan b-cb-d 
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(since CP? = CP" and C2¢P' = cP") 


1 
n! » x C*<@y|Weay> one CPalWriny> 


(letting R = QP‘) 


py Co Wray -? CPualW rn) 
= (Kelle 


which is the desired result. 
Now let {|1), |2>,... } be a complete orthornormal set of states: 


CalB> = Soaps YD laXoel = I. (6.115) 
A complete set of n-particle states consists of |a,, «2,..., %,), Where a, <-°- 
< a, in the Bose case, and a, <--: < a, in the Fermi case. These states are 
orthogonal to one another, but not always normalized. The reader can show, 
using Eq. (6.113), that for a complete orthonormal set of states we may take 


ry 
lays + 25 On) (a, <-°-:<4a,) for bosons 
Jn;! n,! tye 
[a1,.--5 (4, <-*:< a,» for fermions, 
where 1, is the number of times that « occurs in the sequence «,,..., a, (for 


Fermi particles, n, = 0 or 1). 
For either case the completeness relation can be written in the following 
convenient form: 


i 
=, oY, Macs sO, Oss 105m) =: (6.116) 


n! Oy 


Here the range of each a; is unrestricted, duplication of states being taken care 
of by the 1/n! and the normalization. In the Fermi case, the terms with non- 
distinct «, are, of course, zero. The | on the right-hand side of Eq. (6.116) 
means the unit operator on the space of (properly symmetrized) n-particle states. 
Equation (6.116) can be verified by applying the left side to a state |f,..., 6, 
and using Eqs. (6.113) and (6.114). 

The case n = 0 may require some explanation. The zero-particle states are 
tensors of rank zero, that is, scalars (complex numbers). They form a one- 
dimensional space, all of whose elements are proportional to the number 1. 
The ‘“‘state” 1 will be denoted by |vac) (or sometimes by |0)) and called the 
‘“‘vacuum state.” The zero-particle states are thus spanned by the state |vac). 


6.7 Systems of indistinguishable particles 171 


We have constructed, for each n, a Hilbert space that describes a system of n 
particles; thus we have an infinite sequence of spaces. In many processes the 
number of particles is not constant: particles can be created and destroyed. To 
describe such processes we need a Hilbert space that contains states of varying 
numbers of particles. To get such a space we simply combine all the n-particle 
spaces into one big space that we may call the multiparticle space. A general 
state in the multiparticle space is of the form 


> = WO) + WP) + WO) + WO) +o, LIT) 


where |W“ is an n-particle state. | 

We define states of different numbers of particles to be orthogonal to each 
other, so that if |g) is another multiparticle state and is expressed in the manner 
of Eq. (6.117), then 


Kelp) = (pO) + C~P WO) 4-->. (6.118) 


If {|x>} is a complete orthonormal set of states, so that Eq. (6.115) holds, 
then using Eqs. (6.113), (6.115), and (6.118) we may summarize orthogonality 
by 


6 6 


Er 2) | PPERRR (10 ear ra fae (6.119) 
Oanb OenBn 4 
From Eq. (6.116) we also have the completeness relation 
= i 
x te Ds Gi saaicacOhe YOO po ourens Oe.) rye (6.120) 


atl as 


In this equation means the unit operator on the whole multiparticle space. 

As an example, suppose we describe the states in coordinate representation. 
The (unnormalizable) state |x,,..., x,) describes the situation in which there 
is one particle each at points x,,..., x,. Then Eqs. (6.119) and (6.120) become 


b°(x — y,) eae b7(x £3 ) 
(Eps 0025 MalVas- + +s Ym) = Onn : : : In 5 6.121 
S(x,-y) <<: d(x, — y,) ( ) 
~ 
nl d>x, | dP ety os) tie = | (6.122) 


We may expand an arbitrary multiparticle state |» as follows, using Eq. (6.122): 


a | 
lW> = x = [an | atade ey Xp yWM(X,,...5 X,). (6.123) 
Here 


Wey, 4 Mn) = CX ps- ++, HalW) (6.124) 
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is (if |W) is normalized) the amplitude for the state |y)> to have n particles, one 
at each x;. Note that w(x,,..., x,) is symmetric or antisymmetric according 
to the statistics. Note also that if |W) is an n-particle state and is of the form 
lW1,-.-, W,», where each |;» is a one-particle state, then 


W™(x,,...,X%,) =O unless m=n; 


W(x) 77 Win) (6.125) 
W(X) 3 Wil Xn) . 


where w,(x,;) = ¢x,;|W,>. Equation (6.125) follows from Eqs. (6.124) and 
(6.113). In the Fermi case the determinant is called the “‘Slater determinant.” 

We are now ready to define creation and destruction operators. These 
operators are fundamental for two reasons; first, we constructed the multi- 
particle states so that we could describe changing numbers of particles, and we 
need some operators that can effect this change, and second, other operators, 
such as the total energy, will turn out to be simply expressible in terms of the 
creation and destruction operators. 

Let |p) be any one-particle state. We define a*(¢) to be that linear operator 
which satisfies 


W(x, er) x,,) = 


a* (9) |, secesg Ww,» ae lp, Wi, eds) W,)> | (6.126) 


for any n-particle state |W,,...,W,)>. For = 0 this is understood to mean 
a*(g)|vac) = |p). We call a*(@) the creation operator for the state |p), and 
its adjoint a(@) the destruction operator. 

A creation operator clearly converts an n-particle state into an ( + 1)- 
particle state. It is easily seen that a destruction operator turns an n-particle 
state into an (” — 1)-particle state and annihilates the vacuum state. To find 
the effect of a(p) on an n-particle state |W,---w,,) we multiply on the left by an 
arbitrary (n — 1)-particle state (y,--- 7%,-4|- 


(X1 °° Xn-114(P) |W °° Wad 
= (Wr Wala (@)lxa + Xn-1>* 
= Wr Wil, X10 Xn-1>* 
_ oe Wil tn—1> |* 


WilO>VlX> ao WalXn—1> C 
n Wilx> ~— (WilXn-1> * 
- 12 CAD) (no ,) ! 
ne CWalX1> “* WrlXn-1> 4 
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(an expansion by minors) 


SS oN ualoy* ha (a0 Wa) Wale ans * 


by Colt 41 “+, 11Wi °° (no yy) °°: Wi,» 


Because this is for arbitrary (y,° °° %,—1| we have finally, 


a(g)IVr = Wad = YO lVadlvs *** (90 Wy)“ a) (6.127) 


Thus the destruction operator removes the states |y;), one at a time, leaving a 
sum of (n — 1)-particle states. In the Bose case (¢ = 1) the terms all have 
a + sign, whereas in the Fermi case (C = —1) they alternate in sign. 

Equations (6.126) and (6.127) describe the action of creation and destruction 
operators on many-particle states. From Eq. (6.126) it follows that 


a*(p,)a* (2) = Ca*(p2)a* (94), 


or 
[a* (91), a*(2)]-¢ = 9, (6.128) 
where [A, B]_, = AB — CBA; that is, [AB]_(C = +1) is the commutator 
and [A, B],(¢ = —1) is the anticommutator. 
Taking the adjoint of Eq. (6.128) we obtain the further result 
[a(p1), a(p2)]-, = 0. (6.129) 


Thus, the creation operators commute for Bose particles and anticommute for 
Fermi particles, and similarly for destruction operators. 


Now, what is [a(~,), a"(@2)]-~? Does it (or any similar expression) 
reduce to anything simple? We first calculate 


a(p,)a*(Pr2)lWy °° Vi» 
AP (2, Wi Wa 


(Pla) Wy SOUL ef > Cpl 12, Wi°°*(noy,)°**W,)>, (6.130) 


and then 


a*(p2)a(o IW, “+ Wy 


I 
M:= 


Co lWa* (aly “++ (no Wy) °° WD 


k 


y Co We, lP2, Wi (no °°: Wy) 
(6.131) 


1 
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Multiplying Eq. (6.131) by ¢ and subtracting it from Eq. (6.130) we see that 


[a(1), 2° (@2)]-¢ = (91192). (6.132) 


Equations (6.128), (6.129), and (6.132) are the fundamental ‘“‘ccommutation”’ 
relations for creation and destruction operators. 

The relations we have derived are usually stated in terms of an orthonormal 
basis, and we shall now do that. Let {|x>} = {|1), |2),...} be a complete 
orthonormal set of one-particle states. It is usual to let a, = a(a). Then, since 
(a|B> = dyg, we have [a,, ag ]-, = 6,5. We consider the Bose and Fermi 
cases separately. 


Bose Case 
Let 
[1a ate cust eden os aia) 


Vnytng!-* 


where n, is the number of times « appears in the ket on the right. Then the 
|ln;n,°- +) (eachn, = 0, 1, 2, 3,...) form an orthonormal basis for the whole 
multiparticle space. From Eqs. (6.133), (6.126), and (6.127) we find 


Inna D = (6.133) 


ay |nyny ssn) = Vn, +1 Jnyny--+n, + 1---), (6.134) 
a,|NyN2° Nyt) = Jn, jtyma*** Noy ** Ds (6.135) 

The commutation relations are 
[a,, 43] = [af,ag]=9; — [a,, ag ] = dyy. (6.136) 


Equations (6.134), (6.135), and (6.136) are identical to Eqs. (6.60), (6.61), 
(6.51), and (6.52) for raising and lowering operators for a system of harmonic 
oscillators. The operator for the number of particles in the state |x» is 


eer + 
N, = ay aq: 


The notation of Eq. (6.133) (in terms of “occupation numbers’’) 1s generally 
not the most convenient. It is more natural, in fact, to continue using the nota- 
tion we have been using all along in this section, that is, the notation |a,, «2,..., 
a,» This notation was discussed in Section 6.5 in connection with phonons. 
Note that Eqs. (6.126) and (6.127) of this section, when applied to states of the 
form |a,,..., %,), become identical to Eqs. (6.67) and (6.68). Thus, creation 
and destruction operators for a system of Bose particles look just like those for 
what we called a “‘phonon”’ system. 
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Fermi Case 
Using the notation |a,,..., %,», we have 

7a Rac 2) ea oe area: (6.137) 
and 


n 
Aglotys-- 25 Gey = Y CHU Spal 6s ends Metis ss 2 np» (6.138) 
k=1 
We could also use the occupation-number notation 


nyny-*- ) = |, &2°** D 


where a, < a, <--:, and 7, is the number of times « occurs (”, = 0 or 1) in 
this sequence. If n, = 0, then a; changes it to 1, whereas a, annihilates the 
state. Ifn, = 1, then a, changes it to 0, whereas a; annihilates the state. There 
are also factors of +1 involved, depending on what other states are occupied. 
It is easiest merely to remember Eqs. (6.137) and (6.138). 

Note that a(g)? = a*(g)* = 0 for any one-particle state |p). This state- 
ment follows from Eqs. (6.128) and (6.129) (with €¢ = —1 and 9, = 9, = @), 
and it is also equivalent to the fact that two fermions cannot be in the same state, 
that is, |p, p> = 0. 

One could also derive Eqs. (6.137) and (6.138) directly from the anti- 
commutation relations 


[a,, aps = [a;. ap |+ aa 0; [a., ag | = Oup (6.139) 


as was done in previous sections for the a,’s of the harmonic-oscillator system. 
But for the Fermi case there does not appear to be any a priori reason for pos- 
tulating Eq. (6.139). (Remember, for oscillators the corresponding commutation 
rules followed from the canonical quantization procedure.) One may rather 
consider Eq. (6.139) as derived from the antisymmetrization postulate for 
fermions. 

Let us return to the general case where Eqs. (6.126) through (6.132) apply. 
One advantage of deriving them in such a general form is that we are not tied 
down to a particular basis of states. Suppose we use a basis of momentum 
eigenstates, |p). Because ¢p|p’) = (27)°5°(p — p’) we have 


[a(p), a*(p’)|-¢ = (22)°6°(p — p’), 
La(p), a(p’)|-; = [a*(p), a" (p')]-; = 0. (6.140) 
From the vacuum state we can construct the other states by 


Pi, SS 8 Pu> = a*(p;) es a*(p,)|vac). (6.141) 
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If we use a basis of position eigenstates |x), then since (x|x’) = 6°(x — x’), 
[a(x), a*(x')]-, = (x — x’), (6.142) 
|X,,---,%,) = a*(x,)°-- a*(x,)|vac). (6.143) 


If we use a basis of hydrogen-atom energy eigenstates |n/m}, then [a(nim), 
a*(n'l'm')|—¢ = San’ Sy Omm’s aNd SO on. 

How do the creation and destruction operators change when we make a 
change of basis? This question is easily answered by noting that if 


I = al) + Ble), (6.144) 
then 
a*(x) = aa*(w) + Ba* (9), 
a(x) = a*a(W) + B*a(Q), (6.145) 


This means that creation operators “‘transform’’ like kets, whereas destruction 
operators “transform” like bras (because <y| = «*<w| + B*<q@|). Equation 
(6.145) are readily generalized to infinite series and integrals. Now if we change, 
for example, from position to momentum representation, so that 


p> = | d3x|x)<x|p) = | d?x|xye?"*, 


re d3p . (6.146) 
x) = x= er rs, 
a) = |G boladey = | oS le 
the creation operators are related by 
a*(p) a | dearer 
(6.147) 
d>p ie, 
+ — a* eR x. 
a* (x) (on)3 6) 


To relate the destruction operators a(p) and a(x), simply take the Hermitian 
adjoint of Eq. (6.147). One proceeds in a similar way for any other change of 
basis. 


Exercise: Suppose we have a complete orthonormal set of states |a>, and we 
let the “‘wave functions” of these states be (x|x)> = u,(x). Write down the 
formulas for a*(x) and a(x) in terms of at and a,, and vice versa. 


6.8 THE HAMILTONIAN AND OTHER OPERATORS 


In the last section we developed a method for describing systems containing 
many Bose or Fermi particles, and we defined creation and destruction operators 
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for the particle states. We will now show that these operators have other uses 
than merely creating and destroying states. 

Suppose A“ is an operator that acts only on one-particle states. We 
wish to find an operator A that represents the ‘“‘sum of A“? over all of the 
particles.” That is, for any n-particle state. 


ly» a ly, . ee Wn» = Wi x" Xx Wn (6.148) 
we want Aly» to satisfy 


Al) = AM|Wy> x |W2> x -+° x Wn» + lWi> x Aly) xX +> xX [Wad 
+++ [Wy> x [2d x07 K AMY). (6.149) 


To see what this means, suppose each |,) is an eigenstate of A“ with eigen- 
value a;. Then Eq. (6.149) implies that 


Alp) = (a, +°°+ + @,)|\W). (6.150) 


For example, if A“ is the single-particle Hamiltonian, then A is the total energy 
(ignoring mutual interactions, which we shall consider later in this section). 
If A® is the momentum operator for a single particle, then A is the total mo- 
mentum. If A?) = 1 (the unit operator on one-particle states), then A = N, 
the “number-of-particles operator.” 

The desired operator A is easy to find. We first find it for the special case 
A“) = |a)(B|. In this case Eq. (6.149) becomes 


Alp» = (Bly) lo, Wo, pe ee Wn» a CBlva> IW, Ayes Wn» 
$27 + Bids Vases a. (6.151) 


Now look back at Eq. (6.131) and notice that when g, = « and yg, = B we 
have 


a*(aa(B)W) = YEN Bla dar +5 (M0 Vids +5 Yn)» 6.152) 
But 


CK Ta, Wi,---, (No Wy)... 5 Wad = [Wires Wea % Wetae sees Wn) 


due to the symmetry property of the n-particle state. Using this equation in 
Eq. (6.152) and comparing it with Eq. (6.151) we find 


A =at(a)a(B) when A = |xd<Bl. (6.153) 


The generalization of Eq. (6.153) for an arbitrary one-particle operator A“ is 
immediate. We choose a basis—any basis—of one-particle state |«), and write 


AS) = 2, leat 1B><B| 7 D, Ase l)<Bl. (6.154) 
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Then, from Eq. (6.153) and linearity, 
A= >: AYa* (a)a(B). (6.155) 
a8 


As a first example we consider A“) = 1, so that A = N, the operator for 
the number of particles. Using various bases we have 


1%) = dX Jaa 


= | d3x|x)<x| 


_ [ d’p 
= | 1p>< pl (6.156) 


from which we can immediately write 


N = Yiajsa, 
= {¢ 3xa*(x)a(x) 


_ d*p 4 
-_ (on)3 a“(p)a(p). (6.157) 


Next, consider the momentum operator. Because 


Po = | oa PIP><Pl 


= | d?x|x) + Vix] (6.158) 
1 


we have for the total momentum 


_ (4? agt 
p= | opi Pa (PA) 
_ | Je Gee). (6.159) 
l 


(Compare the first of these expressions for P with Eq. (6.100), another expression 
for P.) 
Finally, suppose the Hamiltonian for a single particle is 


2 
HY = i + V(x), (6.160) 
2m 
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where x is the position operator. In coordinate representation, 
(x|H |x") = — = V753(x — x’) + Vix)d%(x — x), (6.161) 
m 
so that 


H= |a°s [aes pee V203(x — x’) + V(x)07(x — | a*(x)a(x’) 


= | axa" | - v" + ve) | a(x). (6.162) 
2m 
In momentum representation* 


2 | _ 
(pIH |p") = a (22)°d°(p — p’) + | d>xe7'?'*V(x)elP* 
m 


=F (22)°6°(p — p') + Vip — p’), (6.163) 
2m 
where 
V(q) = | d>xV(x)e7i8"*, (6.164) 
Therefore 
op a d3p d3p' 7 oe . 
i: p’ ra d3p ‘: 
(2n)° 2m (p)a(p) + (Qn)? = 4, Vig)a*(p + )a(p). 
(6.165) 


The term V(q)a*t(p + q)a(p) annihilates a particle of momentum p and recreates 
it with momentum p + q, the amplitude for this process being V(q). If we use a 
basis of eigenstate |«) of H“, so that 


Kal|H™ |B) = E,byp (6.166) 
then 
H = ¥ E,a; a,, (6.167) 
which is what we had for phonons with E, = ha,. 
* The second term is obtained by noting that 


V(X,) = | d>xV(x)|x) (x 


and putting this between states <p| and |p’). 
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All of these expressions for H can be derived from each other using the 
formulas relating a* (x) and a*(p), and so on, described at the end of Section 6.7. 
But it is often simplest to obtain such expressions directly from the single- 
particle Hamiltonians, as we have done here. 

The density of particles (that is, number per unit volume) at the point x 
is given by the operator 


p(x) = a*(x)a(x) (6.168) 


(which corresponds to the one-particle operator |x)<x|). Thus the number-of- 
particles operator in Eq. (6.157) may be written as 


N= | ¢?xp0, 


and the potential-energy term in Eq. (6.162) as 


V = {¢ 3xV(x)p(x). 


This last equation represents the integral of the potential energy weighted by the 
density. 

So far we have described a system of independent particles, each particle 
being (possibly) acted on by an external potential but no two particles influencing 
each other. Suppose, however, that there is an additional potential V(x,, x,) 
between any two particles at x; and x, (giving rise to a “two-body” force). We 
assume that V(x,, x,) = V(x,, x;). On two-particle states the operator 
is then 


yO = 4 | dx | d?ylx, y V(x, yx, yl (6.169) 


as can be verified by applying it to a two-particle state |x,, x.>. We now want 
an operator V on the whole multiparticle space such that 


Viking ty) = 2, V(x; X11, +--+. Xap 
L<J 
= 4 », VAN, xX, ary X,): (6.170) 
i#j 


Looking at Eq. (6.169) and noticing that a*(x)a*(y) creates the state |x, y), 
whereas a(y)a(x) destroys the same state, we might guess that 


V=14 | d>x | d>ya*(x)at(y)V (x, y)a(y)a(x). (6.171) 
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This is in fact correct, as can be verified by applying V to |x,,..., x,>. Using 
Eq. (6.127) twice, we have 


a(y)a(x)|x1°** x,) 


= aly) ¥ C49 — xy))xy +++ (0-2) --* ¥,) 
k=1 
Fa y CK T6°(x — x,) > NiO CY — x,)[X,°°° (no x,, X;)°*" Xn)s 
k=1 j=l 


where 
ci-} if j<k 
Nik = 4,; ae 
hy if j>k 
Then 
a*(x)a*(y)a(y)a(x)|X1,.--5 Xn) 
= » Cn ,0°(x _ x,)0°(y _ X;)|x, y> X15 a) (no Xk» X;)s a | X,) 
jFk 
= d Cn (x — x)O(Y — XIX Xj, ---, (NO X,, Xj),---, Xn) 
jFk 


Y d(x — x,)O°(y — x,)[x1,.--, Xn)- 


J#k 


Multiplying by 4V(x, y) and integrating over x and y, we find that V as 
given by Eq. (6.171) indeed satisfies Eq. (6.170). 

In view of Eq. (6.168) and the remarks following it, we might expect that 
the mutual interaction could also be described in terms of the particle density 
by the operator 


vi =4 | dx | d>yV (x, y)p(x)p(,). (6.172) 


However, V’ is not quite the same as V. To see the difference we write 
a” (x)a(x)a™(y)a(y) 

Ca*(x)a*(y)a(x)a(y) + 6°(x — y)a*(x)a(y) 

a” (x)a*(y)a(y)a(x) + 6°(x — y)a*(x)a(x), 


p(x)p(y) 


I 


so that 


Vi=V4+4 [eve x)p(x). (6.173) 


Thus V’ contains an extra term, which may be interpreted as a self-energy; 
it contributes even when there is only one particle present. The true mutual 
interaction V is zero unless there are two or more particles. We want only the 
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Fig. 6.1 Operator V adds momentum g to one particle and subtracts it from the other. 


mutual interaction, because any self-energy (if it exists in nature) can be included 
in the Hamiltonian of Eq. (6.162). Besides, for many potentials (for example, 
the Coulomb potential), V’ is infinite and is not what we would consider to be 
the true energy. 

If we express V in momentum representation, using Eqs. (6.171) and 
(6.147), and assume 


V(x, y) = Vix — y) = Viy — x), (6.174) 
we arrive at 
_ 1 dq d>p d3p' 
2 | (2x)° | (2x)° J (2n)° 
Here V(qg) is as defined in Eq. (6.164) (in proving Eq. (6.175) we use V(q) = 
V(—q), because V(x) = V(—x)). Note that 


V(qg)a*(p + q)a*(p' — qg)a(p')a(p). — (6.175) 


d*q = 
ViP1; P2> = V@\pi: + 4,P2 —-@:; 


(27)° 
which says that V adds momentum q to one particle and subtracts it from the 
other with amplitude V(q). This process is denoted by the diagram in Fig. 6.1. 
The total momentum is conserved, as we would expect, because Eq. (6.174) 
implies that the mutual interaction is invariant under translations. 
The Hamiltonian and mutual interactions described here all conserve the 
total number of particles (see the exercise at the end of Section 6.5). 


6.9 GROUND STATE FOR A FERMION SYSTEM 


In this section we will consider fermions, so that the results of Sections 6.7 and 
6.8 apply with € = —1. Suppose we know the eigenstates and eigenvalues of 
the single-particle Hamiltonians, H“): 


Hl) = Ela), (a = 1, 2,3,...). (6.176) 
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Then, ignoring mutual interactions, we have for the multiparticle Hamiltonian: 


H =) E,a; 4,. (6.177) 


Every state may be built up from the vacuum state by applying creation 
operators: 
lo1,.++5%) = a;,..., az |vac). (6.178) 


(Both sides vanish unless the «; are all distinct.) 

It is often inconvenient to refer everything to the vacuum state, as in 
Eq. (6.178). In practice we may be considering states that differ from some 
“sround” state only by the presence or absence of a few particles. Suppose 
there are G particles present. Assume that the energy levels are ordered such 
that 


EB, <FE,< EB, <ees, 


Then the state of lowest energy is 
lgnd) = |1,..., G) = aj -:- agl|vac), (6.179) 
which we call the ground state; its energy is 
Ea Ey or oe Ee (6.180) 


Any other G-particle state will have some of the levels 1,..., G unoccupied 
and some higher levels occupied. It is convenient to use |gnd) as a reference, 
describing the removal of a particle from 1,..., G as the creation of a “hole.” 
Particles in the levels G+ 1,G + 2,... are still called “particles.” If a 
particle is excited from the level « to the level B(a < G < f), then we say that a 
hole with energy — E, has been created as well as a particle with energy Ep. 

The concept of holes may be formulated mathematically as follows: 
Define 


b, = ag for a <G. (6.181) 
From the anticommutation relations in Eq. (6.139) we have, if a, a’ > G and 
B, B' < G, | 
[a,, a, \+ =a La,, bg \+ = Lbs, byl = 0, 
[aa, Gy ]4 = Saar Lbs, bg: |, = Opp’, (6.182) 
[ a. bs \4 => 0. 
Thus the operators a7 (a > G) and b;(« < G) behave like creation operators. 


We can now express the states in the form 


la3,.--5 ms Bi,---, B,, gnd> = af---azbs.-+: bs |gnd) (6.183) 
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(«; > G, B; < G). We call this a state with m particles and n holes. The 
ground state acts like a vacuum state in that 


a,|gnd) = bg|gnd> = 0. (6.184) 
The Hamiltonian of Eq. (6.177) may be written in the form 
H= ¥ E,ata, + Y E,b,bt 


a>G a<G 


yy E,aja, + ¥ E,(—b7b, + 1) 


a>G a<G 


= Bona + > E,az 4, — » E,b; b,, (6.185) 
a>G 


a<G 


so that the energy of a hole in state « is — E,. In other words, it takes an energy 
— E, to create the hole. The number-of-particles operator is 


N = Yiajsa, 
= )i aga,t+ ¥ (-b7b, + 1) 
a>G a<G 
=G+ ¥ afa,-— Y bfb,; (6.186) 
a>G a<G 


thus a hole counts as —1. The number of particles and holes outside the ground 
state (that is, the number m + n in Eq. (6.183)) is given by the operator 
N’'= ¥ aja, + ¥ bfb,, (6.187) 
a>G a<G 
which counts a hole as +1 and the ground state as nothing. 
Suppose now that a perturbation is applied to the system in the form of an 


external potential 
U = >) Usa, Gi; (6.188) 
a,B 


where we assume that the single-particle potential U{;) may be nondiagonal. 
Then 


U = YUS ala, + YY UsSPatbs 


a>G a>G 

B>G B<G 

+ DY UPbady — Y Up bide + LY Use” (6.189) 
a<G a<G a<G 
B>G B<G 


The first and fourth terms of this equation modify the energies of the particles 
and holes, respectively, and the fifth term modifies the ground-state energy. 
The second term creates particle-hole pairs, and the third term destroys them. 
Note that N, as given by Eq. (6.186), is conserved, whereas N’, as defined in 
Eq. (6.187), is not. 
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Suppose further that there is a mutual interaction, a two-body potential 
of the form described in Section 6.8: 
V = y VO Ag Ag 450, (6.190) 
As in Eq. (6.189), we can express V in terms of a, (a > G) and b, (« < G), and 
then use the anticommutation rules of Eq. (6.182) to write V in a form such 
that in every term all of the creation operators are to the left of all of the de- 
struction operators. Such terms are called “normal products.” The result 
will be a number of terms involving four operators, plus other terms that can be 
lumped into U or added to Ejna. 


6.10 HAMILTONIAN FOR A PHONON-ELECTRON SYSTEM 


Now we will consider, as an application of the creation and destruction operator 
formalism, the interaction between the electrons and lattice vibrations in a 
crystal. It is this interaction (as well as the presence of crystal impurities and 
imperfections) that accounts for the finite conductivity of metals under most 
conditions. (One might otherwise expect from the band theory of metals that, 
once an electron got into an unfilled band, it would move unhindered, resulting 
in infinite conductivity.) We will show how the Hamiltonian is derived and 
written in terms of creation and destruction operators. 

We start with the electron in a lattice with no vibrations. Let N = n,a,; + 
N,a, + n3a3 (where n,, nz, nz are integers) describe the positions of the nuclei. 
The potential V(x) felt by an electron in the lattice is periodic and has the form 


V(x) = ¥ V(x — N). (6.191) 


The Hamiltonian for a system of independent electrons in the lattice is, from 
Section 6.8, 


Ha = | @xa*@ i a V* + rs) a(x). (6.192) 


Now, suppose we have solved this part of the problem and know the one- 
particle eigenstates |x) and eigenvalues E,. Denoting (x|«) by @,(x) we have 


h? 
|- Es re| 9.8) = Espa). (6.193) 
Then we can express the electron Hamiltonian as 

Hy = ¥, E,a; a,. (6.194) 


Because 


Ix) = ¥ ladXaley = ¥ ledoze) 
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we have 
a*(x) = ¥ af p5(x) (6.195) 


and similarly 
ay = | d? xa* (x)—,(x). (6.196) 


To obtain relations between the destruction operators, simply take the Hermitian 
adjoints of Eqs. (6.195) and (6.196). 
If, as in Section 6.9, we refer all states to a ground state (called the ‘‘electron 
sea’’), which is normally filled, then in the notation of that section, 
Ha = Egna + x E,aza, — ) Eb; b,. (6.197) 
a<G 
In what follows we shall ignore the ground-state energy, so that the electron 
Hamiltonian becomes H;;: 
.— 2 E,aza, — ¥ E,by b,. (6.198) 
a<G 
Note that, in terms of electron and hole operators, Eq. (6.195) and its adjoint 
become 


a(x) = Y azota) + Y bok) 


a> as (6.199) 
a(x) = X A,P (x) + p> by P(x). 


Consider next the lattice vibrations. For each N let Z, be the displacement of 
the corresponding nucleus from its equilibrium position N. The Z, form a set 
of coordinates for a system of harmonic oscillators. The procedure for finding 
the normal modes and quantizing this system is similar to the field quantization 
of Section 6.6, except that Fourier transforms are replaced by Fourier series, 
with the ““momentum”’ & of the phonons running over a limited region. The nor- 
mal coordinates gq, are related to the Zy by equations of the following form 
(assuming for simplicity that ‘‘a’” runs from 1 to 3 as in the case of one atom per 
unit cell of the crystal): 


Je a> ek eh as ZN; 
Z (6.200) 


i 


Here the region of integration is K = {k| —-xa < k-a; < 1; i = 1, 2, 3}; 
V = |a, X a@,°a;| is the volume of the unit cell of the crystal lattice (recall that 
the a; are vectors describing the periodicity of the lattice), and e, , (a = 1, 2, 3) 
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form an orthonormal basis of 3-space for each k (in that ef ,:e,, = 54), 
chosen so that the mode (Kk, a) is a normal mode. 

You should check the consistency of Eqs. (6.200) as an exercise. It is 
convenient to use the relation 


d°k giK (N-N’) _ 
K (2z)° 


and the easiest way to verify it is to make a change of variables r; = K : a;. 
N — N' can be written as 


ON.N’s 


3 
>; m,a;, 
i=1 


where m, are integers, so that K-(N — N’) = m-r, and the Jacobian of the 
transformation is 


sd | ne 
or or) Ve 
| Ok 


Equations 6.200 are essentially a special case of Eqs. (1.28) and (1.29). The 
change of notation is 


Chapter 1 Chapter 6 
0,(k) the 
JV 
ai(k) The ath component of e, , 
Zu,N The ath component of Z, 
V 
— V 
n 


The normalization conventions used in Eq. (6.200) are convenient because, 
in the limit that the lattice approaches a continuum (i.e., a; — 0), K becomes all 
of momentum space and, letting x = N, we have 


VO | dx and 7, Onn’ + 5(x — x’), (6.201) 
N 


Thus we have obtained the normalization of Section 6.6 for a field g(x), where 


VM/V Zy > (x). 
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Following the procedure of Section 6.6 we have an expression for Zy in 
terms of creation and destruction operators: 


Zy = a > [am [A(k, a)e™ Ne, , + At(k, aje~™* Nex |] 
x (22)? “@ V 2Moa(k, a) “ ° rene 


(6.202) 
where 


[A(k, a), A*(k’, a’)] = (22)353(k — k’)5,0°- (6.203) 


M is the mass of the vibrating atom, and w(k, a) is the frequency of the cor- 
responding mode. The Hamiltonian for the oscillator system (apart from a 
constant term) is 


3 

Howe = | oe > halk, a)A*(k, a)A(k, a). (6.204) 
x (2n)° “a 

(We have used A for the phonon operators* to avoid confusion with the electron 

creation and destruction operators.) 

Having written down Hamiltonians for the electrons and the phonons 
(lattice-vibration: states), we now turn to the interaction between them. The 
potential energy of an electron in an undisturbed lattice was given by Eq. (6.191). 
If the nucleus at N is displaced by an amount Zy, then the potential energy 
changes to 


V(x) + AV,(x) = VVo(x — N — Zy), (6.205) 


where we have assumed that each atom in the crystal acts like a rigid body 
when it is displaced, so that the potential V, arising from it is simply displaced 
by Zy. (In practice, not all the electron shells around the nucleus move by the 
same amount, so that the potential changes its shape as well as being displaced; 
however, we shall ignore this fact.) The potential V,(x) was already included 
in H,,. Writing 


we have for the interaction energy (that is, the extra energy of the system due to 
displacement of the lattice): 


AV,(x) = — ¥ Zy + VWWo(x — N). (6.206) 
N 


* Note that A*(k, a) (as well as the state it creates) has the dimensions of (length)*/, as 
do the corresponding operators for other particles in momentum representation. 
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For the many electron system the interaction Hamiltonian 1s 
Him = | d>x AV ,(x)a* (x)a(x) 
= > ViupIa Ap, (6.207) 
a,B 


where 


ss 
I 


{a xo*(x) AV,(x)p,(x) 


- Y2y: \@ xo*(x)VWVo(x — N)Q4(x). (6.208) 


Now V,, is also an operator on phonon states, because Zy is an operator. 
Using Eq. (6.202) we . 


Vig = a an ay [Cp(k, a)A*(k, a) + C3,(k, a)A(k, a)], (6.209) 
where 
AV ik-N 
Call 0) = = EO eau BetN | PaoZle¥ Vole — Nola 
(6.210) 
Therefore, 


Hin = o On On? Y z [Cig(k, a)A*(k, a) + Cz,(k, a)A(k, a) Jaz ag. (6.211) 


The total Hamiltonian of the system is 
A = Hy + Ao. + Aint: (6.212) 
Note: The states of our system are of the form 
ly. 205 Og Ky, @1,---5 Kino Gm) (n electrons and m phonons), 
or, if we use the hole notation, 
[Wiha ih Ones Dixccag Pas Kis Ciiwess hel, 
(n electrons, m holes, and p phonons). 
The effect of a creation operator A*(k, a) on such a state may be defined as 
A*(k, a)|electrons, holes, k,, a;,...) = |k, a, electrons, holes, a,,... ), 
which is in turn defined to be 


lelectrons, holes, k, a, k,, a,,.... 
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The result of this definition is that all phonon operators commute with all 
electron (and hole) operators. (In general, it is conventional to say that creation 
and destruction operators for different particles always commute unless the 
particles are both fermions. In the latter case it is convenient to define the 
operators to anticommute; then everything is consistent if we decide to call the 
particles different states of the same particle, as with the proton and neutron.) 

If we write Eq. (6.211) in terms of electron and hole operators, we get an 
expression four times as long, involving terms of the form 


A*a‘a, Atath*, A‘ ba, At bb*, 
A a’a, A a‘b*, A ba, A bb*. 


These terms represent the transition of an electron or hole from one state to 
another, or the creation or annihilation of an electron-hole pair. In each 
process a phonon is emitted or absorbed. 

The foregoing derivation assumed that there is only one atom per unit 
cell of the crystal. However, the results are similar if there are more atoms 
(say A) per unit cell, the only difference in the final result being that there are 
more phonon modes for a given k (a running from 1 to 3A). As a simplification 
it may turn out that some of the modes do not “‘couple”’ to the electrons, that is, 
they do not influence the potential felt by the electrons; these modes are 
independent of the others and may be ignored. Such is the case in the Polaron 
Problem (Chapter 8), where only one phonon mode contributes for each k. 


6.11 PHOTON-ELECTRON INTERACTIONS 


Suppose we shine light on the crystal of Section 6.10. What is the Hamiltonian 

now? To H we must add a term H, for the free electromagnetic field (y refers 

to phonons) and a term H,, for the interaction between electrons and phonons. 
For the free electromagnetic field, the classical Lagrangian density is 


B = \(E* — 2B?) 
= 4[A? — c?(V x A)’] 


3 
3 [4 oe e >; (V;A,V;A; aes v4, 40 5 (6.213) 


i,j=1 
where A(x) satisfies the subsidiary condition”) 

V:-A = 0. (6.214) 
* We assume also that the scalar potential is zero. We are using “‘rationalized units,”’ 


but with ¢, = 1; thus e?/4zhc ~ 1/137 and V-E = p, V x B = (1/c*)(j + OE/dt), 
etc. 
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The equation of motion resulting from the Lagrangian is 


- 1 6d? 
0 = c[V7A — V((V-A)] —A=c?|V? -—- 
lve — WA A= ot] 


A. (6.215) 


Carrying out the quantization procedure of Section 6.6, we find m(k) = 
c|k| = ck, and 


d°k : h ik+x + —ik-+x 
A(x) = > is [C(k, r je + C (k, r Je Jex.,- (6.216) 
64 


Here e, , (r = 1, 2) are two unit vectors, perpendicular to each other and to k, 
and C*(k, r) is the creation operator for a photon with momentum #k and 
polarization e, ,: 


[ C(K, r), Crk’, r’)] = (27)°6°(k — k’)6,,. (6.217) 
The Hamiltonian is 


+5 hckC *(k, r)C(k, r). (6.218) 


H, = 
= yr 


To find the photon-electron interaction we replace the operator* P by 
P + eA(x) (the charge of the electron being —e) in the single-particle 
Hamiltonian 


P2 
HP =— + V, 
2m 


so that 
2 
HY + HM) = (P + eA)” +V, 
2m 
1) € e” 2 
= HS) + — (P-A + A: P)+— A’. (6.219) 
2m 2m 
Writing 
A(X) = | atracoer<el (6.219) 
we have 
2 
H® = — | d?xace jx) + | dPxaCetay<a (6.220) 


* P and X are the electron momentum and position operators, and A(x) operates on 
photon states, so that A(X) (see Eq. (6.219’)) operates on both photons and electrons. 
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where 


iQ) 


- = (Pix)<x| + |x)<x|P) 
m 


— © ifvle)Xx1 — eCWCaD)] (6.221) 


Note that j“(x) is the charge —e multiplied by the probability current-density 
operator. The expectation value in any one-electron state |) is 


WOW) = - a [W(x VW*(x) — W*(X)VY(x)]. 


By inspection of Eq. (6.220) we have, for the interaction between photons and 
systems of arbitrary numbers of electrons, 


H,, = — | d? xA(x) -j(x) + < | d? xA(x)?a*(x)a(x), (6.222) 
where 


j(x) = - he i[Va*(x)a(x) — a*(x)Va(x)] (6.223) 


is the electromagnetic current-density operator. 

In Eq. (6.222), A(x) is itself an operator for each x, given by Eq. (6.216). 
If we use Eq. (6.216) in Eq. (6.222), and also express a“ (x) in terms of a} using 
Eq. (6.195), we get terms involving 


C*(k, r)afag, Ck, r)afas, Ck, r)C(k', r')azas, Clk, r)C*(k’, r')az ag, 


and so forth, which have interpretations similar to those of the phonon- 
electron interaction. 

The electromagnetic current defined in Eq. (6.223) may be expressed in 
momentum representation as follows: | 


. d3p’ d’p p+ eee 


This expression will be useful in Section 6.12. 


6.12 FEYNMAN DIAGRAMS 


A graphical method employing what have come to be called “‘Feynman diagrams”’ 
has proven to be very convenient in dealing with perturbation solutions of 
complicated Hamiltonians. These diagrams serve as a ““bookkeeping”’ device 
to keep track of all the perturbation terms and a guide in writing down the value 
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of each term. (The diagrams assume their full power in the relativistic case, 
with which we are not concerned here.) To show how the method works we 
will consider the Hamiltonian for a single, otherwise free, electron interacting 
with the electromagnetic field: 


H = rec + Aint, 


where 


_ (4p Po Ok SS Chath, P 225 
wo = [ERE arate) + | Sos Bea NA) (6225) 


and 
Hw = — {a xj(x) - A(x) + [as = a*(x)a(x)A(x)*. (6.226) 


Here we denote the photon-creation operator by A*(k,r) instead of by 
Ct(k, r): 


A(x) = oie —— [A(k, rje*’* + At(k, rye“ *Je,,, (6.227) 


an 2 


and j(x) is given by Eqs. (6.223) and (6.224). In this section, #7 = 1. 
The states under consideration are of the form 


Ps Kye Pisce ke (one electron and n photons). (6.228) 


According to standard quantum-mechanical techniques, the transition 
amplitude M,; for a transition from an initial state |i) to a final state |f) can 
be expressed in terms of the Hamiltonian (to second order) as follows: 


(2n)*5%(pe — p)Mn = <fIHliy + ISIVPAMIEY + OCH). (6.228) 


Cross sections and transition rates are proportional to the absolute square of 


the amplitude: 
Rate = > |[M,;|7(27)*57(p, — p,)O(E, — E;). 
f 


In practice, the states |i) and |f> are usually states of the form given in Eq. 
(6.228), that 1s, states of particles of definite momentum. It is therefore con- 
venient to express H;,, in terms of the A(k, r), substituting Eq. (6.227) in 
Eq. (6.226), and to use Eq. (6.224) for j(x). 


Ain, = Hy, + Hp, (6.230) 
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where 
_ (d°p’ ( dip ( d’k p +p. 1 
od is ps a Lena | tee ek 
x [(22)°6°(p' — p — k)a*(p')a(p)A(k, 1) 
+ (2)°5°(p' + k — p)a*(p')a(p)A*(k, r)], 


and 


, = (ae (ae a las a ae 
(2n)> J (20)? J (2x)? J (Qn) 2m Jock J2ck 

x [(22)953(p’ — p — k’ — k)a*(p’)a(p)A(k’, r’)A(k, r) 
+ (22)353(p! + hk! — p — k)a*(p')a(p)A* (k’, r')A(k, 1) 


+ (2n)°5°(p' + k — p — k’)a*(p')a(p)A(k’, r')A* (k, 1) 


(6.231) 


+ (22)°6°(p' + k' + k — p)a*(p')a(p)A*(k', r’)A*(k, r)]. (6.232) 


Note that the total momentum is conserved. 


Each of the terms in Eqs. (6.231) and (6.232) is represented by a diagram 
in which a straight line denotes an electron and a wavy line a photon. For H, 


the diagrams are given in Fig. 6.2a and 6.2b, and the amplitude for each 1s 


oP tP ° Chor a 
2m /2ck 


For H, the diagrams are those shown in Fig. 6.3a through 6.3d, and the 


amplitude for each is 
e? 1 bien aie 
SS PS te een eee k’,r’ k,r° 
2m x 2ck's/ 2ck 


(p+k= p’) . (p=p’ +k) 


(a) (b) 
Fig. 6.2 Feynman diagrams for A. 
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(p+k+k’=p’) (ptk=p' +k’) 
(a) (b) 


(p+k’=p’ +k) (p=p’'+k+k’) 


(c) (d) 
Fig. 6.3 Feynman diagrams for H>. 


Note that parts b and c of Fig. 6.3 represent essentially the same process. In 
fact, after the integrations are performed, the second and third terms of 
Eq. (6.232) are equal;* so in any process involving such terms we need only 
calculate for the case shown in Fig. 6.3b, say, and multiply by 2. 

Now suppose we have a definite amplitude to calculate, for example, that 
for Compton scattering. The whole process is denoted by Fig. 6.4. To find the 


P, ky, ry 
(p, +k,= p.+k,) 


Fig. 6.4 Feynman diagrams for Compton scattering. 


* We ignore the infinite self-energy of the electron that comes from rearranging the 
A and A* of the third term in Eq. (6.232). 
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(a) (b) 


Fig. 6.5 Feynman diagrams with the same ingoing and outgoing lines as Fig. 6.4. 


amplitude we draw all diagrams having the same ingoing and outgoing lines as 
that of Fig. 6.4, using those of Figs. 6.2 and 6.3. Thus we get the three cases 
shown in Fig. 6.5a, b, and c. Parts 6.5a and b correspond to the second-order 
term of Eq. (6.229), in which the intermediate state |n) consists of one electron 
and H = H, + O(e’). (The fact that the photon lines cross in Fig. 6.5b is of 
no significance.) Using Eqs. (6.229) and (6.231) and removing the factor 
(22)°5°(p, + ky — p; — k,), we get for these contributions to the amplitude 


€ Pot pth, - € py t ky + py . 
Ck2,r2 Ck yrs 


yi — \W2cks 2m J 2ck, 2m 
= 2 2 
Pi + ck, = (Pi a ky) 
2m 2m 
(6.233) 
and 
( é Ptp-h., )( é P-~k+m., 
cate Kir k2,r2 
M® = V2ck, 2m J/2ck; 2m 
aa 2 = 2 
Pig ox, — Pia) 
2m 2m 
(6.234) 


We may associate the two factors in the numerator with each vertex, and the 
denominator with the “propagation” of the intermediate electron. 
Figure 6.5c corresponds to the first-order terms of Eq. (6.229) in which only 
H, can contribute. Remembering the factor of 2 due to the two ways in which 
this process occurs in Eq. (6.232), we get 
MoO =2- e ! 


Se Ck2,r2 ; C ksi (6.235) 
2m ,/2ck,J/2ck, 
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The total amplitude for Compton scattering is 
M = M® + M® 4+ M© + O(e*) (6.236) 
In equations (6.233) and (6.234) 
Pi tk =p. +k, and Ki Gy, = hr ey, = 0. 


In fact, if the initial electron is at rest (i.e., p,; = 0), then M® and M® both 


vanish. 

The results we have derived are satisfactory for low photon energies 
(ck, « mc”). For high energies we would have to use a relativistic theory and 
also take into account the spin of the electron. 


CHAPTER 7 


SPIN WAVES 


7.1 SPIN-SPIN INTERACTIONS 


In this section we will consider interactions that give rise to a coupling between 
the spins of two objects of the form 


H x S,° S$. (7.1) 


In later sections we will find out what happens when such a coupling appears 
in a lattice full of spins. 

An interaction of the form of Eq. (7.1) can be due, first of all, to the magnetic 
interaction between two magnetic dipoles. The interaction between two magnetic 
dipoles yz, and py with separation r is 


H = bea — 3A Ma") Ue = | ~ TH 1,5") = H' +H", (7.2) 
r 

where H’ is the term in brackets. (The term H” with 6°(r) arises from a corres- 

ponding term in the magnetic field of a dipole, which appears when one evaluates 

V x (@ x r)/r?). 

Suppose the two particles are orbiting around each other (as in an atom) 
and we consider the matrix element of H between the two states of orbital 
angular momentum / and /’. As a function of the direction (0, @) of r, H’ is a 
combination of spherical harmonics of order 2; therefore the matrix element is 
zero unless |/ — 2| < /' < / + 2. In particular, the first-order splitting of the 
S-state (J = I’ = 0) is zero. (This argument is just another way of saying that S 
states are spherically symmetric and H’ averaged over all directions is zero). 
However, H” contributes only for S states, because only those have nonzero 
wave functions at r = 0 (a state with angular momentum / varies as r’ near the 
origin). Thus, for S states we have an interaction proportional to py ° My, 
which is proportional to S, - S, because 


ee gS, (7.3) 
2m;c 


for a particle with mass m,, charge qg; and g-factor g; (Eq. (7.3) in fact 
defines g;)). 


198 


ya | Spin-spin interactions 199 


In addition to direct magnetic interactions, a term of the form of Eq. (7.1) 
can appear indirectly as a result of spin-independent interactions combined 
with the exclusion principle. As an example, consider the hydrogen molecule; 
take the nuclei to be fixed and ignore their spins. The two electrons may be 
described by their positions x, and x, and some component of their spins S, 
and S,. If w, and w, are energy eigenfunctions for a single electron with energies 
E, and E,, then 


Wapl(X1, X2) = Wa(X1)W4(%2) (7.4) 


has energy E, + &,, neglecting mutual interactions between the electrons. The 
state W,(x,, X.) = W,(x,)W,(x2) has the same energy. 

If we add a mutual interaction V(x,, x.) = V(x2, x,) which is spin- 
independent between the electrons, then the two-electron eigenfunctions become 


1 
Ws = V2 (Wap + Wha)s 
; (7.5) 
Wa = —= (Wan — Wa)» 
V2 
with their respective energies 
E;=E,+&£,+I1—- J 
(7.6) 


E,=£&,+8,+1+ J. 
Here 


Lae 
I 


| VV 42x, d2x, = | VeVi 42x, dx, 
and (7.7) 
| WS Vibes @°x, dx, = — | WEV ay 42x, dx, 


~ 
I 


As implied by the notation, W, is symmetric and w, is antisymmetric under 
exchange of x, and x. 

We have not yet reckoned with the electron spins or the exclusion principle. 
We must multiply our spatial wave functions by spinors that describe the 
possible spins of the electrons, and the total wave function must be antisym- 
metric under the simultaneous exchanges of coordinates and spins. Now for 
two spin-} particles, the symmetric spin states have total spin 1, and the anti- 
symmetric states have spin 0 (see Section 7.2). The total wave functions are 
then of the form 


Vs = Wx, v= Wa (7.8) 


where x is any two-electron spin state with total spin s. (Note that S and A 
still refer to only the spatial symmetry). In other words, the energy eigenstates 


200 Spin waves 


are also eigenstates of the total electron spin. We can then write the energy in 
terms of the spin as follows: If S is the total spin, S-S has eigenvalues 
s(s + 1) = Oand 2. The energy of a state with total spin s can be written in the 


form 
E = Es + (E, — Es) es” (7.9) 


because it equals E; when s = 0 and E, when s = 1. Therefore we can write 
the mutual-interaction Hamiltonian for the system in the form 


H = E, + (E, — Es) m2 (7.10) 
Now, 
S:S = (S; + S,):(S, + S,) = 34 2S, ° 8, (7.11) 
because 
SS, = 4G + D = 
Therefore 
H = Ey + (E4 — Es)(4 + S,° S2) 
= (Es i 3E4) a5 (Ey, = Es)S, - S, 
= E, + E, + I + 4J + 2JS, x S,. (7.12) 
In other words, the splitting of the energy levels is described by 
AH = 2JS, - S, (7.13) 


which is of the form of Eq. (7.1). We have thus taken a spin-independent inter- 
action (e.g., a Coulomb potential) between the electrons and made it look like a 
spin-spin interaction. Note that the splitting is proportional to the “exchange 
integral” J defined in Eq. (7.7), which depends on how much the wave functions 
W(x) and w,(x) overlap. Note also that the derivation of Eq. (7.12) depends 
crucially on the fact that electrons have spin 4; for higher spins we would get a 
polynomial in S, - S, if we went through a similar procedure. 

The interaction of Eq. (7.13), due to the exchange integral, turns out to be 
very strong compared to the usual magnetic-dipole interactions. 


7.2 THE PAULI SPIN ALGEBRA 


It is convenient and conventional to use the Pauli spin matrices when dealing 
with a particle of spin 4: 


v= (1 = (j “a =(¢ i): (7.14) 
1 0 i 0 0 -1 


The spin is then 
S =e (7.15) 
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where 
o = 0,€, + de, + 0,e, (7.16) 


and where we take f# = 1 (otherwise S = ha/2). Equation (7.14) implies that 
we are using a basis of states in which the z-component of the spin is diagonal- 


ized; the states 
la) = (5) and |B) = (‘) 7.17) 


are states of spin up (S, = 4) and spin down (S, = —4), respectively. From 
Eq. (7.14), we have 


o,|4) = |B), — ayla> = iB), o,|%) = |x), 


(7.18) 
o,|B) = |x),  o,|B) = —ila),  9,|B) = —|B). 
The algebra of the o-matrices is as follows: 
| 0,0, = —0,0, = i0,, (7.19) 
and cyclically in x, y, z 

0,0, = 0,0, = 6,6, = 1. (7.20) 

Alternatively one may use the following elegant form of Eq. (7.20): 
(a:a)(b:-c) = a:b + ia x b-a, (7.21) 
where the a; and b; are numbers (or other operators, provided that they commute 


with oa). 
Consider next a system of two spin-4 particles (ignoring spatial and other 
degrees of freedom). As a basis of states we may take 


lax>, |aB>,  |Ba>, — BB), (7.22) 


where, for example, |«f) is the state in which the first particle has its spin up 
and the second has its spin down. We then define a, to act on the first spin and 
ao, on the second. That is, for example, 


J, |aa> = i|Ba>, 
Orylaa> = ilaB). 
Note that o, 6; = @;0,;, which is to say that operators pertaining to different 
particles commute. 
We have previously considered i (and will consider further in the section to 
follow) the operator S, -S, = 40,-0,. By explicit calculation, we find 
d,°o,\4%)> = laa), 
d,°6,|08> = 2|Pa> — |aB), 
0, °6,|Bu> = 2|aB> — |Ba), 
a,°6>|BB> = |BB). 


(7.23) 
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Writing |aw> = 2|au> — |a«) and similarly for |BB), we immediately come upon 
the useful relation 


61°06, = 2p” — 1, (7.24) 
where p'’? is the operator that exchanges the spins of the particles; that is, 
p’’?|aa> = jaw), p'*|aB) = |Ba), and so on. 


The symmetric states 


11,1) = ax), [1,0) = ie eed I, —1) = [BBY (7.25) 


all satisfy p*’*|1,m> = |1,m) and therefore o,-o,|1,m) = |1,m>) (m = 
1,0, —1). The antisymmetric state 

10,0) = 1B? — |Ba) (7.26) 

V2 
satisfies p'’?|0, 0) = —|0, 0) and thus o,-o,/0,0) = —3{0,0). Therefore 
the eigenvalues of o, -o, are 1 and —3. Now the total spin is 
S = 46, + io, (7.27) 

and its square is 

S:-S = 3 + 4o,- a4. (7.28) 


Therefore 
S- S|1,m)> = 2|1, m), S- S|0,0> = 0, 


so that (as we anticipated in the notation) the states |1, m)> have spin 1 and 
|0, 0> spin 0. The fact that the symmetric states have spin 1 and the anti- 
symmetric ones have spin 0 may also be seen directly from the relation 


S:S=p'* +1, (7.29) 
which is derivable from Eqs. (7.27) and (7.28). 


7.3 SPIN WAVE IN A LATTICE 


Consider a lattice, the ith member of which has spin o;. The interaction between 
the spins is due to the exchange effect, and the Hamiltonian is given by 


H = > A; ;6; ° GC ;- (7.30) 


tJ 
Consider the lattice to be cubic, and let each point on the lattice be denoted 
by the integer vector N = a(n,e, + nye, + n,e,), where a is the lattice spacing. 
Then 


H= Ay n’On * On’. (7.31) 
N,N’ 
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It is clear that the interaction depends only on the distance N — N’ = M, 
and if in addition it is assumed that only nearest-neighbor interactions are 
important, 


H = » A, (on * ON +ae,. + on ° ON +aey + on * ON +ae,): (7.32) 


Occasionally next-nearest-neighbor interactions are important. They may 
be even more important than the nearest-neighbor ones, but such a situation 
can be analyzed in the same way that we use here. 

We now return to Eq. (7.32) and consider the one-dimensional case. The 
extension to three dimensions will be immediate. We also are interested in the 
ferromagnetic case (A, < 0), and we can neglect boundary effects by con- 
sidering the system to be a closed loop. Let there be N lattice points; gy4, = ¢,. 
Thus, 


H = —A pd 0, ° On+1 (7.33) 


where A = —A,. The minus sign in front of A is included to emphasize the 
fact that in the case of ferromagnets, —A < 0. That is, the energy is most 
negative when the spins are parallel. The (rare) instances when —A > 0 are 
due to a double-exchange effect. Recall that we used « to represent spin up and 
B spin down, and we defined the spin exchange operator p’’? = 4(1 + 6, - 4), 
satisfying 


p'’*|au> = laa, 
p’’*|BB> = |BB), 


p’’*|aB» = |Ba, (7.34) 
p'**|Ba> = |aB). 
Then 
H=-—A y (2p™"*! — 1) = NA — 2A y pent, (7.35) 


To solve Schrédinger’s equation with the above Hamiltonian, we must find the 
eigenstates and eigenvalues of H. To do so, notice first that Eq. (7.34) implies 
that H operating on a state with a fixed number of f’s gives a state with the 
same number of f’s. Since all states can be represented as linear combinations of 
those with a fixed number of f’s, we have a method of partially diagonalizing H. 

Rather than trying to completely diagonalize H, let us merely find eigenstates 
with low energy. 

One suspects that the lowest state (most negative energy) occurs when all 
the spins are parallel. For example |a«--- «a> is such a state. More generally, 
when we say all spins are parallel what we really mean is that the total spin is as 
large in magnitude as possible, and it can be shown that the states that satisfy 
this condition are completely symmetric states. The completely symmetric 
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states are invariant under p""**; so p""*+ can be replaced by 1 in computing the 
energy of those states. Thus, we expect that 


Egna = (completely symmetric spin state |H| completely symmetric state) 
N 
=NA-2A)1=-NA 
n=1 


is the lowest-energy state. That this is the lowest energy is obvious. The 
eigenvalues of p”"** are +1; thus the minimum conceivable energy can occur 
only when for all n we get the maximum possible expectation value for p""*?, 
that is, when we can replace p""*! with +1, 


Problem: If —A > 0, what is the lowest state and what is the lowest energy? 


In order to take the ground state as the zero of the energy spectrum, define 


N 
H' =H + NA = —-2A ¥ (p™"*! — 1). 


n=1 


One may suspect that the next-to-lowest states has all but one spin parallel. 
But which one? It is clear that 


lWi> = |Bax---a)>, |2> = |aBa--- a>, 
lV, = la: aBa---a) (6 in nth place) 


are equivalent and that none of them can be an eigenfunction. We look for a 
superposition of the |y,). Let 


(7.36) 


W> = da, War. (7.37) 
To solve the equation H'|w) = El), 
Y (Ea,)\W.> = Ely> = H'|b> 
= =9A y A, y (pe! — 1lv,) 
lala 3 anl(p™"** — 1) + (p"*" — Ilys, 
= -24 ¥ ayllor1) + le) — ad 


Because the |) are orthogonal, we can equate coefficients of |) to get 
Ea,, — —2A(G,41 + Qan-1 = 2a,,). (7.38) 


To solve this set of equations, let a, = e’*" and substitute into Eq. (7.38). We 
thus obtain 


E = —2A(e® — 2 + e~) = 4A(1 — cos 6) = 8A sin?” -. (7.39) 
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Fig. 7.1 Energy as a function of 9. 


Neglecting boundary conditions as we have done is tantamount to assuming 
that the N lattice points are arranged in a circle, or that the N + 1 point is 
the first point. In other words, 

Qye, =, or eMNFD = el or  eiNd = |, 
This means that 6 = (2z/N)J/; / an integer; 


—N/2 <1 < N/2. (7.40) 


We have, from Eqs. (7.39) and (7.40): 
Pagina ease! © (7.41) 
2 N 2 


This function is plotted in Fig. 7.1. There is, in other words, a band of energies, 
the minimum energy occurring at 06 = 0. If 6 = 0 we have a completely sym- 
metric state, which we have already concluded has the lowest possible energy, 
E = 0. Another way of seeing that the state with 6 = 0 has the same energy as 
lax ---a» is as follows. Let each spin be tipped from its “‘up” position by an 
angle 0, where 6 is very small. Such a state is given by 


ee Sue eee Se a 
2 Z Z 2 


Because 0/2 is small, cos 6/2 ~ 1 and sin 6/2 ® yn « 1 (y = 0/2). Then 
Ix? = I@ + nBYa + nB---)). 
Neglecting terms in 4* or higher powers of 7, we have 
l~> = law--:> + n(|Baa---a> + laBu---a0> +--°). 


Subtracting out the ground-state term, we obtain |y’>) = >, n|W,)> where |W,» 
is given by Eq. (7.36). Since |y> must be an eigenvector of H with the same 


eigenvalue (energy) as |«, «,..., «),* so must |x’), and (1/n)|y’> = >, |v). 


In = 


* Since in the absence of external interaction there could be no change in energy when 
all of the spins are rotated in a certain angle. 
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For 6 # 0, we see that the spins are out of phase. This suggests the concept 
of a wave, and we now rewrite e™" to make the nature of the wave somewhat 
clearer. 

The one-dimensional integer “‘vector” is na, where a is the lattice spacing. 
We can write a, as a, = e'@" = e'*@"), which describes a plane wave of wave 


number k. Such waves are known as “spin waves.”” Thus, 


PeWia<cgska. tele. (7.42) 
a Na 
For small ka, which means long waves, cos ka ~ 1 — k*a?/2, and 
E = 2Aa’k? (long wavelength). (7.43) 


The extension to three dimensions gives us no trouble: The lowest state again 
has all spins parallel. Subtracting out the ground energy, we obtain 


H’ = — »» Anu(On* Onim — 1) 
N,M 
and, in a manner similar to that used in the one-dimensional case, 
> = Dek My) and E= —¥ Ay(e™™ — 1). 
N M 


For nearest-neighbor interactions (in a cubic crystal), 


E= —2A{(e'*=4 = 1) 3 (e7 'Kxa = 1) a (eK _ 1) Ho as +} 
= 8A Jsin? aia + sin? sis + sin? aad , (7.44) 
2 2 2 
For long waves 
E = 2Aa*K* = K?/2n, (7.45) 
where 
1/2u = 2Aa?. 


To find the next band of energies, we must consider the case when all the 
spins but two are parallel. But first we will digress to discuss the semiclassical 
interpretation of spin waves. Problems dealing only with spins have often an 
immediate semiclassical interpretation. Let us look at the spin wave case. 


71.4 SEMICLASSICAL INTERPRETATION OF SPIN WAVES 


The Hamiltonian is 


N N 
H = —2A oy (pe ~ 1) = —A de (6,,° Ont+1 — 1). (7.46) 
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The Heisenberg equation of motion for the mth spin is 
é, = , (He, — o,H). (7.47) 


We transform this using the commutation relations: 


0,0, — 0,0, = 2io, ete. (7.48) 


so that 


(0, ° 61 )01x a T1,(02 : o,) = (02,5 1x a OryF1y as 02,0 17)01x aa O1x(° : 2) 


= 2i(F,F2y — O10 2y) 


= 2i(o, X 6), (7.49) 
Using Eq. (7.49) in Eq. (7.47), we can write 
he, = 2A(6, X On414 — Gn—-1 X Gy): (7.50) 


We regard this as the classical equation of motion for the vector o,. This 
is a nonlinear equation, but it can be linearized when we consider o,, to be 
close to 1, in which case we can approximate thus: 


NGn x 

he, y 
We expect the vector o, to rotate around the z-axis, and we also expect the 
solution to be of the form 


2A(2¢,,, ™ Unt+itiyy — On—1,y) 
—2A(2o,, —~ Ont+i,x On-1,x)- 


Rw 


(7.51) 


GC, © c sin ate", 


(7.52) 
Ony & c cos wte”™, 
where a is the lattice constant. When we use Eq. (7.52) we see that 
ha = 4A(1 — cos ak). (7.53) 


7.55 TWO SPIN WAVES 


When the spins at positions n, and n, point downward we define the wave 
function @,,,n,: 


IPninr> = |&- +> eBa-- + Ba++ a), (7.54) 


where the B’s are in positions n, and n,. The Hamiltonian is the same as that of 
Eq. 7.46, and the Schrédinger equation is 


H [ly = 4Ae |p; Energy = 4Ae?? (7.55) 
where 


Wy» re yy An, nal Pnino>* (7.56) 
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When Eq. (7.56) is used in Eq. (7.55) and the coefficients of ¢,, ,, are compared 
we can distinguish two cases. When n, 4 n, + 1, in other words when the 
‘two down spins at 1, and n, are nonadjacent, we have 


2EAy, ng = (Qn, ng a An, —1,n) a Ce ~~ An, +1,n2) 


+ (Qin = Oi ii) zo (Gans ~~ Qn, nz+ 1): (7.57) 


On the other hand, when 1, = n, + 1 the two down spins are next to each 
other and Eq. (7.57) does not hold. We then have 


2EQn n+ 1 oa (Gantt ic An—1n+1) ate (Gaint4 = Ann+2)- (7.58) 
Before solving Eqs. (7.57) and (7.58) rigorously let us derive some approx- 
imate results. Because the case n, = n, + | is only one out of N (the number 


of lattice points), for most of the time Eq. (7.57) holds. Therefore, the first 
crude approximation should correspond to two noninteracting waves: 
Ann, = e 


oe efk2nag, (7.59) 
When we use Eq. (7.59) in Eq. (7.57), we see that 
e) = (1 — cos k,a) + (1 — cos ka) 
e(k,) + &k), (7.60) 
where 4Ae(k,) is the energy for one spin wave of wave number k,. 


At low temperatures, only the lowest energy modes are excited; thus for 
one spin wave, the energy might be, 


2 
p= ev), 

Qu 4Aa? 
kt, ke 


Qu Ow 


We can form wave packets with group velocity dw/dk, where E = hw and the 
phase velocity is w/k. The statements above bring out strikingly the similarity 
of spin waves to crystal vibrations or phonons. By analogy to the concepts of 
“phonon,” “photon,” and so on, we call a spin-wave excitation a “magnon.” 
Since the approximate system discussed above is made up of independent Bose 
particles, we can use the expressions that we discussed before. The free energy 
Fis 


ikinya 


and for two spin waves 


E= 


F=kT {Ind —e )—; - (7.61) 
(27)° 
For low temperatures only the E(k) near the bottom are significant, so that we 
may approximate 


E(k) & k?/2p, 


7.6 Two spin waves (rigorous treatment) 209 


ko 


Scattering 


Fig. 7.2 Two spin waves, or the scattering of two quasi-particles. 


and 
ro) 2 
F=kT In(1 — e> ee) 4nk" dk (7.62) 
‘“ (27)? 
The energy is 
00 2 2 
7 k* 2u 4nk* dk (7.63) 
ek? /2ukyT =| (27) 3 


This energy depends on T to the 5/2 power: 
Uc Tr. 

Therefore the specific heat is proportional to the 3/2 power of T: 
Cor, 


This result for spin waves was derived by Bloch. 
Although the system is made up of ideal Bose particles, Bose-Einstein 
condensation does not occur because the number of particles 1s not fixed. 


Problem: When there is an external magnetic field, derive the energy of the 
spin waves. Find the magnetic susceptibility. 


7.6 TWO SPIN WAVES (RIGOROUS TREATMENT) 


Bethe treated the two spin waves in a linear system rigorously.* When there 
are two spin waves in a system, the problem can be looked upon as a scattering 
of two quasiparticles, as in Fig. 7.2. For the scattering process the energy and 
the momentum are conserved, and the eigenstate is a superposition of the 
incoming and outgoing waves: 


a = qet(kinia + kana) ae Belk2mia + kinza), (7.64) 


ny1,nm2 


From this point onward we will let a = 1. That is, for convenience we are 
either assuming unit lattice spacing or absorbing the factor ain k. As mentioned 
earlier, 1, < n,. The ratio «/f of the coefficients is related to the phase shift 


* See A. Sommerfeld and H. Bethe, Handbuch der Physik, Vol. XXIV/2, p. 604 (1933): 
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resulting from the scattering, and it is to be determined from the scattering 
equation. 

The left-hand side, a,, ,,, of Eq. (7.64), must satisfy Eqs. (7.57) and (7.58). 
First, when a is substituted in Eq. (7.57), we see that 


e?) = (1 — cosk,) + (1 — cos k,). (7.65) 


This equation has the same form as Eq. (7.60), although we expect k, now 
satisfies a different relation from that found in Section 7.1. 

Next we can use Eq. (7.64) in Eq. (7.58). Rather than doing that immediately, 
notice that Eq. (7.58) is a special case of Eq. (7.57) if we define a, , (which was 
left unspecified before) by 


ny,N2 


2Ann+1 a Qn n a Qn+1,n+1 (7.66) 


Then, substituting Eq. (7.64) in Eq. (7.66) is equivalent to substituting Eq. (7.64) 
in Eq. (7.58). From Eqs. (7.66) and (7.64) we have 


2(ae**2 ee Be'*) =-oa+ B ol qeilki +k2) at Bellhs tha) 
It follows that 


OL 1 xs eilki +k2) 2 Jeik 
B = 4 4 gilkitka) _ Qoika 


_ COs (k, + k,)/2 — gilki—k2)/2 


= — see 7.67 
cos (k, + k2)/2 — ei)? ee 
If k, and k, are real, 
es where z= ogg nn — jeilhi~ka)/2 
Bz" 
Therefore |a/B| = 1, and in fact 
a/p = e’? (7.68) 
where /2 is the phase of z: 
2 Imz_ cos(k, + k,)/2 — cos (k, — k,)/2 
ane sot _ ee ; 
2 2 2 
or 
2cot ? = cot fis cot K2 (7.69) 
2 Z 2 


We will also consider cases in which k, and k, are complex. Then |a/P| is no 
longer 1, but g may still be defined by Eq. (7.68) (@ will now be complex). 
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Equation (7.69) holds, for such cases, as an algebraic identity or by analytic 
continuation from the case of real @. 

For scattering problems we assume k, and k, to be real. Then |a/f| = 1, 
as is expected, and g, which is real in this case, is the phase shift after the 
scattering. One special solution is 

a = evi? 


B = ev i#l2 (7.70) 
From the substitution of Eq. (7.70) in Eq. (7.64), this becomes 
Onin = eilkimi tkon2+ 0/2) 4) pilkam +kinz— 9/2) (7.71) 


When we impose the periodicity requirement with the period of N, we see that 


Anan +N — Qn, jn" (7.72) 


(Because n, + N > nz, we do not write a, +n, = %,.n.:) Equation (7.72) is 
satisfied when, from Eq. (7.71), 


= 2mm, + 7) 


(7.73) 
Nk, = 2mm, — 9, 


where m, and m, are integers. It must be noticed here that Nk, is not equal to 
2nm,, aS was the case for one spin wave, but is changed slightly by the amount 
of yg. Using Eqs. (7.69) and (7.73) we can eliminate k, and k,, and determine @ 
as a function of m, and m,. 

It can be shown that the real values of k do not exhaust the possibilities. 
We can examine the case of complex wave number by putting 


Rape te (7.74) 
k, =u — iv. 


Equations (7.70) through (7.73) still hold (although @ is complex). With 
Eqs. (7.74) we can rewrite Eq. (7.73) in terms of u and v: 


T 
u = —(m, + m,), 7.75 
a 1 2) (7.75) 


op = mm, — m,) + iNv. (7.76) 


From Eq. (7.75) it follows that u is real. Using Eqs. (7.74) in Eq. (7.67), and 
(7.76) in Eq. (7.68), we obtain a relation between u and v: 


—v 
eit(m2—m1)g—No = __ COS u __ é 
cos u — e° 


Now, if we hold v fixed and let N — oo, the left hand side — 0; so we have for 
large N 


e °x cosy, (7.77) 
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so that v is approximately real. (We have assumed that Re v > 0, but the 
following result will be the same if Rev <0.) The energy is derived from 
Egs. (7.65) and (7.74); use of Eq. (7.77) leads to 
e?) = 2 — cos (u + iv) — cos (u — iv) 
= 2(1 — cos u cosh v) 
sin? u = 4(1 — cos k), (7.78) 


2 


where we have defined 
k= 2u=k, + ky. (7.79) 


To get a,, ,, we use Eqs. (7.74) and (7.76) in Eq. (7.71). The result is (with a 
change in the normalization) 


N 
Anny = el tndkl2 cosh v (7 +n, — ma) 
oF " (7.80) 
Gg _ gilt +n2)k/2 sinh (F + ny _— ra) 9 


according to whether m, + mz is even or odd. a,, ,,, has a sharp maximum when 
Ny = Ny (or ny = n, — N) and decreases as n, — n, increases (remember we 
assumed n, S n,). 

Our solution represents a bound state. What is the binding energy for this 
quasiparticle? When there are two independent spin waves both having wave 
number K, the energy is 


e = 211 — cos K) = K? — K*/12 +--- (7.81) 
For a quasiparticle of wave number 2K, we find from Eq. (7.78) the energy 
e = 4(1 — cos 2K) = K* — K*/3 +-:-. (7.82) 


Equation 7.82 gives a lower energy than Eq. (7.81) does, but the difference is 
very small, namely, K*/4. In other words the binding is very weak, and it 
depends on K in such a way that the long-wavelength spin waves do not interact 
much. This is the reason why the approximation of non-interacting spin 
waves is a good approximation for low temperatures. 


7.7 SCATTERING OF TWO SPIN WAVES 


In this section and the one that follows we will approach the two-spin-wave 
system in a slightly different way. In Section 7.6 we found the eigenstates of the 
Hamiltonian containing two spin waves. Here we will consider the simple waves 
found in Section 7.5, which are not eigenstates of H but describe pairs of 
independent spin waves. Then we will apply perturbation theory to calculate 
the amplitude for scattering of these waves. 
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The general Hamiltonian for spin waves in three dimensions is written as 


H = — Y, 2Ay(ph-™ — 1); (7.83) 
M,N 


where A,, is a constant. For the one-dimensional, nearest-neighbor exchange 
interaction case, we may write 


H = —24¥ (p™"*! — 1). (7.84) 


The scattering of two spin waves will be described in this section for the one- 
dimensional case, although a similar analysis holds for the case of three dimen- 
sions. 

We know that a ground state of Eq. (7.84) is 


lp) = |aw--- a), (7.85) 
for which all spins point up. When one of the spins points down, 
1 ikn 
Wo = = Le™lg,), (7.86) 


JN 
is an eigenfunction of Eq. (7.84) with the eigenvalue 
E, = 4A(1 — cos k). (7.87) 


When there are two spins pointing down, 


1 ikyn, ,ik2n2 
Wissko> ee Sy ei tthe lPning> (7.88) 


JN? nyn2 


describes a state in which two spin waves exist. However, Eq. (7.88) is not an 
eigenfunction of Eq. (7.84), as we saw above, and the two spin waves scatter 
each other. Let us examine how the two spin waves interact. By definition, 


lPninz>” = |Pngn,) iS the state where spins are down at n, and n,, and |@,,,,) = 0. 
We want to evaluate 


VN? HW = ¥ ee” HI0,,.,)- 


nin2 
Here 1|9,,,,,> 18 calculated for four cases: 

1) n, and n, are separated, that ism. # ny,n. An, + In, 4 n, — 1. 
li) m2 =n, + 1. 
lll) m2 =n, — I. 
iv) Ny = n,. In this case |9,,,,) = 0. 


For case (1), 


A|Pn.n>> = —2A[ (On, 41,02) — lPnina>) + (Op. aii) — (>) 
+ (lPn,.n2+1) o lPnina>) + (ln, .ng-1> a IPning>)}- (7.89) 
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For case (ii), 


A @nn+1> = — 2A[(l@n-1.n4+1> _ lPnn+1>) a (lPnn+2> 7a lPnn+1>)1- 


In order to conform with Eq. (7.89) we will write the equation above in the form 


A@nnti> = —2A[-2|@nn+1> + (Opi 534) = lPnn+1>) 

+ (lQnn+2> = lPnnt+1)) = 4A Pant 1) (7.90) 
where the term in square brackets is Eq. (7.89) with n, = n, + 1. Similarly, 
for case (iii), 

A\@yn-1) = —2A[ —2|@nn-1> (Op 20) = lPnn—1>) 
+ (l@n+1.n-1) — 1@nn-1))] — 44l@an-1>- (7.91) 
For case (iv), 
H|9,,,) = 0 
—2A[@n+ 10>? + 1Pn-140) + [Pnnti? + lPnn—1>] 
+4A[|On n41) ay lPnn—1) I (7.92) 
Here the first term is Eq. (7.89) with n, = ny. 


When we use Eqs. (7.89) through (7.92) to evaluate J N?.H Wiska>s the 
first terms of Eqs. (7.90), (7.91), and (7.92) combined with Eq. (7.89) lead to 
the sum of Eq. (7.92) over n, and n, without restriction; thus 


JN? AWik. = —2A BS ere er Ee (e7 = 1) 


nyn2 


Pe Sl) (en = 4) 


-_ 4A » ee EY Oi ea) 4. lala is || ae 
N 


= pac (i eee + Pnn-1)] 


= 4A[(1 — cos k,) + (1 — cos ky) VN? Wika + VN2\x). 
(7.93) 


The ‘“‘remainder”’ |x) is written as 


JN? Ix) ev x eilkr + k2)n( pike 4+ eft yf elltatkanig 1) (7.94) 


The first term of Eq. (7.93) is, as seen in Eq. (7.88), a double sum over n, 
whereas Eq. (7.94) is a single sum over n. Therefore, the second term of 
Eq. (7.93) can be regarded as expressing the scattering of the two spin waves 
that appear in the first term of Eq. (7.93). 

Notice that when H operates on |;,,,,» a new state emerges, but the new 
state is still a combination of states with two flipped spins. In other words, 
when two spin waves encounter, two spin waves come out. 
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With the help of Eq. (7.87) we may write Eq. (7.93) as 


A Wass» = (E,, Bs Ex )|Whske> + Ix», (7.95) 
where 
4A 
Ix) = ~ TN2 eee Oe aes (7.96) 
with 
E = efk2 4 iki — 1 — gilt tha) (7.97) 


7.88 NON-ORTHOGONALITY 
The general theory of scattering tells us that the cross section o satisfies 


(number of final states) (7.98) 


a= = [M;i|* x 
h AE 


where v is the velocity of the colliding particles. From perturbation theory, 


_ A A ipolni y .., 
My, = H¢; + >» EE, + (7.99) 
H;, is the matrix element of the perturbation Hamiltonian between the initial 
and final states. In this section the discussion will be limited to the first term 
of Eq. (7.99). 
We are interested in the scattering from the initial state |y,,,,> to the final 
state |W, 1,» where 
Ey, + Ey, = Ex, + Et,. (7.100) 


Then we cannot use Eq. (7.99) as such, because the states |,,,,)> are not 
orthogonal to each other. This is shown as follows. We compute the inner 
product, using Eq. (7.88): 


I i(kyn n2—Linj- y 
pe een Bae aes 2260, ntl Onin» (7.101) 


IN? seca 
nin2 nyin2 


(Wri talks ko? 


From the definition of |¢,,,,>, we see that 


tf 
CPninglPning> = L when n, = ny #nz= Nn, Or ny =n, #¥ Ny = N}, 


= 0 otherwise, including the case nj = n, = ny = ny. 
(7.102) 
Thus, we can write 


(Wri talWeiks> = re pa a a La + gsr a ial 
nyn2 


re — » eilki +k2-L1—-L2)n (7.103) 
~ 
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In the first sum of Eq. (7.103), n, and n, are summed without constraints, so 
that we see 


2 yo er han te Painal = 1 when ky; = Land k3°=.L; 
= 0 otherwise. (7.104) 


If the second term of Eq. (7.103) were absent, that equation would show the 
orthogonality of W,,,,- Ifk,, k, are not equal to L,, L, but 


(k, + ka) — (L, + L,) = “0” (7.105) 


where “0” denotes any integral multiple of 2x. Then, although the first sum of 
Eq. (7.103) vanishes, the last term contributes —2/N. Thus, |W;,,,,> are not 
strictly orthogonal to each other. 
To calculate the scattering amplitude we consider the first term of Eq. (7.99), 
using for the perturbation Hamiltonian 
H' = H — (&, + &,)- (7.106) 
Therefore, 
M;; iad (Wr \(H ~ (Ex, a Ex5))|Wreska> = CWrit|X)> (7.107) 
where we used Eq. (7.95). Using Eqs. (7.96) and (7.88), we see that 


—4A 


M¢; as 2 3 > ere HOT E Gi \Onn id) 
N n mnyn2 
_ —4A y ef(ks tha L1~ L2)nyg— ik i e iLiye (7 108) 
ON? 


because 


(Pninz|Pnn +1) = 1 when ny, =n, Mg =N +1 or n, =n + l, Ny =n 


= 0 otherwise. 
Further, in Eq. (7.108) 
yy elk tha-Ea-Ta)n — N when k, + k, = L, + L, + “0” 
= 0 otherwise. 
Thus, using Eq. (7.97), we have 


Mg; Jest es (em a e~ 1h2)( elk + elk2 = 1 parr ei (la +k2)) (7.109) 


when k, + k, = L, + L, + “0”. Because of this relation between k and L, 
and also the relation in Eq. (7.100), M,; is symmetric in k and L. 
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When we expand M,, for small k; we see that 
|M|? ~ (ky: k,)°. 


Therefore, unless k, is large, the scattering cross section is small. 


7.9 OPERATOR METHOD 


It is possible to rewrite the Hamiltonian for spin waves in terms of creation and 
annihilation operators. 
Using the spin operators o, and o,, we define a, and a_ by 


6, = iz ae, i, (7.110) 


ho 
ho 


Remembering that 


0 1 0 -i 
o, = and = 7.111 


0 1 0 0 
t= : 5) and ¢c_= ( 5): (7.112) 


It is easy to show that 


we have 


g,0. =}$+ to,. (7.113) 


When |g)» is the “vacuum” state in which all spins are up, we can write 
the state |g, > in which the spin is down at N as 


lpn» = n-|o)- (7.114) 
The state |p, of Eq. (7.86) (generalized to three dimensions) may be expressed as 


1 ik: 
lp.) = (= d ef *on- ive) = a, |Po), (7.115) 
where we define the creation operator of a magnon to be 


at y et Noy. (7.116 
ae ) 


=F 


The destruction operator is then 


i 
Sey ENG (7.117) 
VNN 
Further, |;,,.4,» Of Eq. (7.88) is then 
Wisk? = 9%,4b,|P0)- (7.118) 
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To see that this is correct, remember that 
o_o_ = 0. 
Equation (7.117) may be solved for oy, as 
1 ik: 
Gyn, = — Y* Na, (7.119) 
VN‘ 
The Hamiltonian of Eq. (7.83) includes p’’’*™, which contains oy . 6y+my 
or terms of the form 
g,02 + o,or + a207, 
in which the first two terms can be rewritten in terms of products of two a’s. 
The last term corresponds to products of four a’s because of the result shown in 
Eq. (7.113). 
Now let us find out if the a’s satisfy commutation relations such as 


[a,, a7 | = Oye. (7.120) 


The answer is no, and is related to the fact that the w,,,, are not orthogonal to 
each other. We can disprove Eq. (7.120) using Eqs. (7.116) and (7.117) together 
with the fact that [o,, o_] = @,: 


i a 
[a,, a; ~ WN Da Ne Ton, On2+] 
1 g-ik-L)Ng 
“ye? 
ee = Fe *-DNgy — 1), (7.121) 
N 


because 


Le -i(K-L)'N gS 
N'‘N ; 


On account of the second term in Eq. (7.121), Eq. (7.120) does not hold exactly, 
although the second term is numerically of the order of 1/N when there are only 
a few particles with spin down. 


7.10 SCATTERING OF SPIN WAVES—OSCILLATOR ANALOG 


Figure 7.3 shows two waves of momenta K, and K, interacting and “‘emerging”’ 
with momenta L, and L,. The scattering matrix M,;; is given by 


My, = —24 ¥ ef ML — eM R21 — eM ES), (7.122) 
M 
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Ly Le 


L,;=K,+Q 
Lo=K2—-Q 


K, Ko 


Fig. 7.3 Scattering of two spin waves. 


For small K,, Ky, M,;; ~ K,K, ~ K?. The scattering cross section o is given 
approximately by 

o ~ |M/? ~ K+. (7.123) 
At a temperature 7, 4K?/2u = kT and K* ~ T. From Eqs. (7.122) and 


(7.123), o ~ T*. For ordinary particles, M,, ~ 1; hence o ~ 1, and the virial 
coefficient b, ~ T1/?. In the case of spin waves, however, o ~ JT? and thus 
b, ~ T?!?, (7.124) 
It is plausible to suppose that in a virial expansion we must first consider 
independent spin waves, for the next term, two-particle collisions, then three, 
and so on. Such plausibility arguments can be made rigorous by the use of an 
‘harmonic oscillator analog.” 
In this analog, corresponding to the spin state where the ith spin is down we 
will take the oscillator state where the ith oscillator is excited to the first level 
and the others are not excited at all. Hence, 00100 corresponds to aaBaa, and 


so on. Let ay be the creation operator for the oscillators. That is, ay creates a 
phonon (say) in the Nth oscillator. 


gee 
VN 


Now the real H (for spin waves) acts as follows: 


an = 


Y ate ik, (7.125) 
N 


Hapa) Paani’ (7.126) 


We want now to construct an analog H for the oscillators so that 


(01000) 


H(00100) > (9949): 


Now 


Se 
l 


> [-2A(a;4, 1% — 0t;" 04;) 2A(a;~ 1% — a," o;) | 


— ¥ Auli — aN)oy = V exagay, (7.127) 
N K 
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where 
Ex = > A,(e'X'™ — 1) 
M 


will do the trick as long as only one £ is present. However, consider the 
following 


(ao Pacr) 
H(aaB Ban) > (oceBorBer)” (7.128) 
However, the H defined by Eq. (7.127) acts differently: 
H(001100) > (010100) 
_ (001010) 
v2 (000200) — 
./2 (002000) < 
Note that the arrowed functions in Eq. (7.129) have no analog in Eq. (7.128). 


Therefore we must add terms to H so that these arrowed terms will disappear; 
that is, we want to define a new H’ = A + A such that 


(010100) 
(001010) ° 


Hence the term A that is added to H behaves like 


(7.129) 


H’(00100) > 


A(o100) » ~¥2 (000200). 
— 4/2 (002000) 
Then 
A = > DAL oes 105 1054 1; — OF 108; 4 1%; (7.130) 


is the new term we need. For example, «;, ,«; annihilates two adjacent phonons 
at iandi + 1, and o;,,a;,, creates two ati + 1. 

H' = H +A defined as above is not Hermitian, and terms such as 
a;", ,0;, a, must be added. H is an exact analog for the real H if states with only 
one f are considered; H’ is exact if states with one or two f’s are considered; 
and so forth. 

Note that the state (00200) may contribute an energy E, such that e~**' is 
not negligible, in which case the e” °*' term must be subtracted out when forming 
the partition function, e~ *". 


Problem: Using a,’s and «;;’s which satisfy anticommutation relations will solve 
the problem of states with more than one “particle.”” What are the difficulties? 


CHAPTER 8 


POLARON PROBLEM 


8.1 INTRODUCTION 


An electron in an ionic crystal polarizes the lattice in its neighborhood. The 
interaction changes the energy of the electron, and furthermore, when the electron 
moves the polarization state must move with it. An electron moving with its 
accompanying distortion of the lattice has sometimes been called a polaron. 
It has an effective mass higher than that of the electron. We wish to compute 
the energy and effective mass of such an electron. 

Even without any vibration, there is a (periodic) potential acting on the 
electron. This potential will be approximated by assuming that its presence 
causes the electron to behave as a free particle but with an effective mass, M. 
Hence the energy of the electron when there are no lattice vibrations is P*/2M, 
where P is the electron’s momentum. 

It will be recalled that the dispersion in a crystal is as shown in Fig. 8.1. 

We will be interested in the region described by the almost constant part of 
the optical branch, that is, the branch for which the positive ions move in a 
direction opposite to that of the neighboring negative ions. 

If we consider the permittivity as a function of frequency (Fig. 8.2), the 
portion of the curve that will interest us is that part where the lattice vibrations 
are important. 


a Optical branch 


K 


Fig. 8.1 Dispersion in a crystal. 
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+— Lattice vibrations 


Fig. 8.2. Permittivity as a function of frequency. 


Let us take the crystal lattice as being a continuum with a unit cell at each 
point x. The lattice vibrations are quantized according to Section 6.10, where 
the transition from a discrete lattice to a continuous one is made according to 
Eq. (6.201) of that section. Before writing down the Hamiltonian let us consider 
which vibrational modes actually interact with the electron; the others may be 
ignored in this problem. 

Using the notation of Section 6.10, we assume that the electron’s potential 
due to an undisturbed lattice is either zero or else is taken into account by 
reducing the mass as already described. If the lattice is displaced, the potential 
AV, (x) felt by the electron is due to the charge density p(x) resulting from the 
displacement, and this in turn 1s due to the polarization P(x): 


V2 AV,(x) = ep(x) = —eV: P(x) (8.1) 


(where the electron has charge —e). Assume that the crystal lattice has a 
positive ion and a negative ion in each unit cell. There are then six modes of 
vibration for each wave number K. In three of the modes both ions move the 
same distance in the same direction; as K — 0 these modes approach rigid 
translations of the whole crystal and w(K) — 0. Such modes do not contribute 
much to the polarization P(x). In the other three modes the positive and negative 
ions move in opposite directions in each unit cell, producing a polarization that 
is proportional to the amplitude of the mode. For these modes the frequency 
approaches a nonzero value w as K — 0, and we shall make the approximation 
that w(K) = for all K. That is, we assume that the frequency is independent 
of the wave number. | 

Let a*(K, a) be the creation operator for phonons. By analogy with 
Egs. (6.200) and (6.202) (in which A*(k, a) is the creation operator for phonons) 
we see that the polarization (which must be proportional to the displacement) is 
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of the form 
aK 3 ee : eee 
P(x) = « | —— ¥ [a(k, aje'X’*ex , + a‘ (K, aje '™’*ex ,| 
(2n)° a=1 


where «’ is a real constant. The charge density is then 
p(x) = —V- P(x) 


_, (d°K 


3 
= j Y [a(K, ae’ *K- ex, — a*(K, ale *K - ek 4]. 
1 


* | Qn «A 

To a very good approximation we have one e,, which is in the K direction 
(the longitudinal optical mode) and therefore two which are perpendicular to it 
(transversal optical modes). Actually these differ in frequency, and have a 
frequency ratio of (€)/e,,)'/*, where &) and «,, are the static and high frequency 
dielectric constants of the crystals. Only the longitudinal mode, the one with 
(ex ,\|K) contributes to p(x); denote the creation operator for that mode simply 
by ax and ignore the other modes. Thus 


° d°K iK:x + ,-iK:x 
p(x) = ia (2n)3 K[axe — axe I, (8.2) 


so that, from Eq. (8.1), the electron’s potential energy due to the lattice vibrations 
is 


d°K 1 . = 
AV (x) = —ieow’ | —— — Tazel®’* — ate" * 
1(x) (ny al - : ] 
= h>w?> 1/4 d?K 1 ees = 
=a oN age —[age™* — age™'*], (8,3 
i(/ 2x0) ( M ) | io ral K gen" *] (8.3) 


where « is a dimensionless constant. It can be shown that 


2 1/2 
2\E,  E/ ho\ fh 
where é) and €,, are again the static and high-frequency dielectric constants of the 
crystal. In a typical case such as sodium chloride, « is about 5, and in general it 


runs from about | to 20. 
The Hamiltonian for the free electron and lattice is 


Pp? d*k 

Ae T Hose = 2M so ho | Ay Ay, (8.5) 
where we are considering only one electron with momentum operator P, and 
we are ignoring the other lattice modes (which do not interact with the electron 
according to our assumptions). 
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To simplify slightly the calculations of the following sections we shall assume 
the units to be such that A = M = wm = 1. Also we shall sometimes use the 


notation 


d°*K 
= |——; ogg = (22)°6°(K — K’). 
1 Es: xx: = (2m) 5(K — K’) 
With the assumptions made above (due to Froéhlich*), the problem is 
reduced to that of finding the properties of the following Hamiltonian: 


H = $P* + Yagag + i(/2 on)! /2 y= [age * — axe'™*]. (8.6) 
K 


K 


Here, X is the vector position of the electron, P its conjugate momentum, and 
Ay , Ix are the creation and annihilation operators of a phonon of momentum K. 

The polaron problem is not of special importance, but we have now reduced 
it to mathematics. The method we shall use to solve this mathematical problem 
will be applicable to different problems of a similar nature. Our method will be 
valid for arbitrary coupling, but we will first assume small « and will use con- 
ventional perturbation theory to get, first, an answer that we can compare with 
our results for arbitrary coupling, and second, a description of what happens if 


the electron goes too fast. 


8.2 PERTURBATION TREATMENT OF THE POLARON PROBLEM 


We wish to find AZ, the perturbation energy when there are originally no lattice 
vibrations. The Hamiltonian is H = H, + H’ where 


Pp? 

Hy = 2 + DIK aK 

’ 7 |y 1/2 | + -iK:X iK-X 
H’ = i(J2 20) dx (aKe ie | alan) 

K 
A ond n 
AEo = Hoo + Lino po F's (8.7) 
ig Oo” n 


where H),,, = <m|H'|n), |m) and |n) are eigenstates of Hy and E°® is the 
eigenvalue of H, corresponding to |n). 

Because H’ acting on a state changes the number of phonons, we have 
Hoo = 0. We are interested in the energy of an electron traveling with the 
momentum P. Therefore, we consider the diagram in Fig. 8.3. Here the initial 


* H. Frohlich, Adv. in Physics, 3, 325 (1954). 
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P 


Fig. 8.3 Electron with momentum (P — K) and phonon with momentum K. 


state has an electron of momentum P and no phonons. In the intermediate state 
the electron’s momentum is (P — K) and a phonon of momentum K exists. 
Thus the energies E° and E°® are 


P2 
Ee = > 5 
_ 2 
Eo = Poy +1. (8.8) 


Here ‘‘1” in E° is the energy of the phonon (in our present units). 
= I : 
Hyo = (V2 n0)*/2{n| » K (axe '*’* — axe'*’*)|P; no phonons), 
K 


which is 0 unless |”) represents a state with one phonon and an electron of 
momentum P’. If the phonon has momentum K, 


Hy» = i(/2 ma)'/*¢ P’, no phonons |(1/K)e~‘*'*|P, no phonons). (8.9) 
Since e~'*"*|P) = |P — K), P’ must be P — K, as expected, so 


Hyo = i(/2 na)!/2(1/K) 5p px: 
and 

- 1 1 
AE, = -V200 3p wD 1. PD x 2. (8.10) 
The extra factor of 2 is a consequence of the spin of the electron. There are two 
intermediate states with momentum P — K. Generalizing Eq. (8.10) to three 
dimensions and writing the sum as an integral, we find 


Qe NT RAK? = OPK & 2)" 


(8.11) 
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In evaluating this integral, we may make use of the identity 
ee (ee 
ab 9 [ax + b(1 — x)]?- 
Regarding b = K* anda = K? — 2P-K + 2, wecan write Eq. (8.11) as 


(8.12) 


AE, 


UE: @K [* dx eee 
eee (22)? Jo [x(K? — 2P-K + 2) + (1 — x)K?]? 


opt d*K|(2n)° 
~dnl2 a | = fz ~— 2xP- K + 2x) 


_ d?Kj{(2n)3 
—4ny/2 a | ia ha — xP)? + (2x — x?P?]?° 


Now we use the integral: 


[gic 2, ail 
[K? +a? 82a 


to find 
AE 4ny/2 |, si 
0 8n/2x — x?P? 
_4Any/2 2 ag 
8x P </2 
Thus, finally, 7 
AE; = —«a v2 sin™! ae (8.13) 
P J2 


AE; = —d&, (8.14) 
If the perturbation expansion is carried to higher order,* Eq. (8.14) becomes 
AE = —« + 1.26(«/10)?. 


This is the perturbation energy when the electron is at rest. When P is small, 
we may expand Eq. (8.13) in the form 


me HEH 


P?2 
ee ee ee eee 8.15 
ss (8.15) 


* FE, Haga Progr. Theoret. Physics (Japan) 11, 449 (1954). 
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When we combine this perturbation with the kinetic energy, the total energy if 
2 2 2 
P ae <a 
2 12 2(1 + «/6) 


Equation (8.16) shows that the mass of the electron is increased (1 + «/6) 
times by the interaction with the phonons. That is, 


(8.16) 


Melt as quote (8.17) 
m 6 

In Eq. (8.13), if P > 2, AE; becomes imaginary. This implies that the 
energy of the electron dissipates by creating a phonon in the same sense as in 
Cerenkov radiation. Let us evaluate the smallest value of P sufficient for this 
decay effect. Obviously, the process that could give the smallest possible P is 
the one in which P is in the direction of K. Consider the case shown in Fig. 8.4. 

By conservation of energy, we know that 


(P~ KY |, _ ?? 


= —_., 8.18 
: ; (8.18) 
Equation 8.18 can be rewritten as 
kK 1 
—+— =P. 8.19 
2 K or) 


The smallest P that can satisfy Eq. (8.19) corresponds to the minimum value 
of the left-hand side, which is /2, so that 


P> v2 (8.20) 


is the condition for the “Cerenkov” effect to occur, as we might have expected 

from Eq. (8.16). 
Let us next calculate the rate of this Cerenkov effect. This rate is given as 
Rate = >. 2n|H;-o|76(E; ram E;). (8.21) 


Final 
states 


The initial state i consists of an electron of momentum P, and in the final state 
we have an electron of momentum (P — K) and a phonon of momentum K. 
From Eq. (8.6), we have 


’ l ra 
Hy, = (P', 1xlAinlP> = K (/2 Ta)’ "Opi pax: 


Fig. 8.4 Electron dissipates energy by creating a phonon. 
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where 1, means one phonon of momentum K. Thus 
5 _ K\2 2 3 
Rate =! = 2m (V2 ((P- KY, _ PGK, gy 
T K? 2 2 } (2n)° 
The 6-function insures conservation of energy. We may write 


d°K = 4nK? dK(dQ/4z) 


so that 
ee 2 
Rate = 2/2 « {a [ax s(—PK cos 6 + = + t). (8.23) 
| T 


where 6 is the angle between P and K. 
When K, is a solution of 


K2 
—PK, cos 0 + a +1=0, (8.24) 


we may transform 


[axo(-PK cos 8 + x + ) = [ax 6[4((K — P cos 6)? — (Ky — P cos @)’)] 


\Kg — Pcos 6] 
Thus Eq. (8.23) becomes 
_ dQ 
=  , 8.24 
Rate = 4V/2 a me — P cos 0 ee 


When the momentum of the incoming electron P is given, K, of the created 
phonon is a function of the angle 0, and is given by Eq. (8.24). Solving Eq. (8.24) 
we obtain 


Pcos@=V2 : y 
Rate = 99 04 | __ sn de = 20 v2 cosh7 a . (8.25) 
0 JP? cos? 6 — 2 P /2 


It should be noticed that since PIV aes 1, we may write 


-1 P _2 


4 
sin + icosh~/ —. 


a 2 /2 


Thus, cosh~! P//2 in Eq. (8.25) is the imaginary part of sin~! P/V 2, a factor 
we obtained in Eq. (8.13). The connection between Eqs. (8.13) and (8.25) can 
be seen as follows. When the rate of transition is y, the amplitude of remaining 
in the original state has a factor e~”/*, and the time-dependent wave function 
has a factor 


e (1/2)to tet =_ e Kenney: 
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2 
|Hzo| 
—- 0 +0 
E,—E 9 


Fig. 8.5 The summand in Eq. (9.7) 


thus —y/2 is the imaginary part of the energy. Equation (8.25) agrees with 
Eq. (8.13) because both are for the limit of small « and both are based on a 
perturbation treatment. 

Although we have found Eq. (8.25) to be consistent with Eq. (8.13), you 


may object; Eq. (8.13) should be valid only when P < /2. The situation is 
clarified as follows. Strictly speaking, Eq. (8.7) does not hold when E® = E®, or 
when the sum over E° diverges near E3. Schematically drawn, the summand in 
Eq. (8.7) has the shape shown in Fig. 8.5. The correct way to take care of the 
region near EQ is to write the perturbation as 


H,,H 
AE = On*+# nO 8 7! 
dz SE? 4 ie on 
and take the limit as ¢ —> 0. 
We see 


1 x ig 


x+ie x7x*> +67 x? 4+ ¢? 


The first term has the shape shown in Fig. 8.6, and to use this term is equivalent 
to taking the principal value when integrating. The imaginary part approaches x 
times a 6-function as e — 0, because 


"7 g 
_o xX +6 
Thus we may write 


lim 
270 X + ié 


= principal value 8 — in (x). (8.26) 
x 
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f(x) 


Fig. 8.6 The functio Se 
g ion f(x) career: 


O 
Fig. 8.7. The integral | © 8 dean 
sie ee 
Thus Eq. (8.7') may be written as 
HH, 
AEg= Yo wm, — it Y lAypol” (Ey — Eo). (8.27) 
ee 1 Eo , E, Sf 
principa 


part 
Equation (8.27) shows the consistency of Eqs. (8.7) and (8.21). Although 
Eq. (8.13) was originally calculated for P < J 2 the analytic continuation was 
tacitly made by raising the pole slightly. 
Problem: For low-frequency sound waves 
wo = KC, 
C, being the velocity of sound. Calculate the critical value of P for radiation of 


sound. Calculate the direction of the emerging sound waves as a function of 
energy. Evaluate the rate of emission. 
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8.3 FORMULATION FOR THE VARIATIONAL TREATMENT 


The partition function may be written as 


e PF = Trfe ™*] = ye ™. (8.28) 
When f — oo, the leading term is e °¥°. Therefore, we claim 

lim & In (Tr “my = Ei (8.29) 

Bo 


Thus, we first calculate the partition function to evaluate E,;,. We are partic- 
ularly interested in the case of large P. 
Using the path-integral representation, the partition function is written as 


Tr (e 8) = [e° Q(path). (8.30) 


The path-integral representation is used when the Hamiltonian is written in 
terms of the coordinates and momenta. 

Since H is written in terms of the creation and annihilation operators we 
must first undo it and write it using coordinates and momenta. If we quantize 
the motion of a crystal, we must choose the creation and annihilation operators 
so that 

fe2= 1 
K J2 
L i+ 
Pe = —= (4-x — 4x). 
J2 


(ax + a_x) 


(8.31) 


But then the interaction between the electron and the phonons is of the form 


— 1 
Hix. = (2 na)'!/? y r7 [ate iK'x _ ayei® X] 
K 


ax 1 
i(/2 mx)*/? > iK\ (atx — axe'™'* 
K 


ae ae 1 
V2 (V2 na)? Ye pgell, 
K |K| 


We do not find the above form for H,,, suitable, because in path integrals we 
want potentials that are functions of position, rather than of momentum. 
We can exchange the roles of q, and p,x either before quantization of the crystal 
motion (by means of a canonical transformation) or afterward, as follows. 

Let ax = —ia_,. Then if we define g, and px, as in Eq. (8.31) we find 


4x = Pei DPe= —4k. 
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The resulting Hamiltonian (dropping the primes) is 


Se gik x 
H = 4P? + ¥ Gp2 + 4q2) + V2 (V2 am)? eo (8.32) 
K K 


Thus, the Hamiltonian is written in terms of the electron coordinate X, 
momentum P, phonon coordinates gx, and momenta px. We write 


Tr (e748) = e~> Dx(u) Dq,(u) Dq,(u):-- (8.33) 


X(0)=X(B) 
qi(0)=4i(B) 


where the action integral S is 


f= ‘les 4 y 4(42(u) i q2(u)) 4 J2 (/2 am)!/2 » qx(uye* = 4 


(8.34) 


The advantage of this method is that the path integral over the phonon 
coordinates can be performed because gy, and g, both appear quadratically 
(and linearly) in Eq. (8.34). We use the result proved previously*, and find 


B 
fox \- | 4G? oo) eaGenl aul Dq(u) 
(9) 


1 (8 fe e 
= exp 1a | | y(t)y(sje?!*~s! dt ash. (8.35) 
4@ Jo Jo 


Since y and g are complex here, we modify Eq. (8.35) to read y*(t)y(s) instead 
of y(t)y(s). We will need this modification after Eq. (8.36). Also, Eq. (8.35) is 
approximate, for e~°® is nce eal When we use Eq. (8.35), Eq. (8.33) is 
written as 


Tr (e7*") = io Qx(u) 


-| exp (- aa xq) iu) exp (23 an |. i. i 


x exp (iK: (X(t) — X(s))e7"~*! dt ds) DX(u) 


* Equation (3.39.) 
+ The complete form is written by replacing e~@!*—s! in Eq. (8.35) by 


e 
lek fee 


—q@|t—s| erlt—s| eB 


But remember that £ is very large. 
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or, changing >’, into | d°K/(2z)’, 


34 _\iK -(X(t)—X(s)) 
s=5 | X? du ~ J2an | [| On a 2 e ltsl dt ds. 
T 


So 
alee te a 8.36 
7 Al ° vel. |, |X(t) — X(s)| ae 4 


Actually, we are in error when we treat all the q,’s as independent real variables 
as we did in Eq. (8.35) above. The g,’s are complex, with qx = q_x. But if 
we went to the trouble of doing it properly, Eq. (8.36) would still come out as 
it did above. 

Our aim in using this path-integral approach is to combine it with a 
variational theorem described before. We write 


e-S Math) = Lee Apath) 
{ e-°° D(path) 


e~ 5° (path) 


= Ce S50) [es (path). (8.37) 


where S, is an appropriately chosen approximation of S. We use the inequality 
< et » > e <f> 


to write Eq. (8.37) as 


jes Q(path) > e 5°? Jes Q(path), (8.38) 

where 
(Sig oO Soe Fay 
J e~*° D(path) 


Because for the approximate choice of the Hamiltonian the free energy Fp is 
given by 


(8.39) 


ge PFo — je“ PR(path), 
we can write Eq. (8.38) as 
F< Fo + 48 — So». (8.40) 
Before choosing So, let us examine the meaning of S in Eq. (8.36). The 


first term is analogous to a kinetic energy, and the second term represents the 
potential energy. The special feature of this potential is that the potential at f, 
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where ¢ is thought of as a “‘time,”’ depends on the past, with weight e~ !*~!. This 
is a type of retarded potential, and the significance of the interaction with the 
past is that the perturbation caused by the moving electron takes “‘time’’ to 
propagate in the crystal. 

When « is small, we may take only the first term of Eq. (8.36) as Sy: 


So = t [3 *(u) du. 
Then, from, Eqs. (8.40) and (8.29), we obtain 
E<E,+ lim 7 <8 ~ So). 
Boo 


This gives the old perturbation answer 


8.4 THE VARIATIONAL TREATMENT 


We have seen that the use of Sy for the kinetic energy 4 {6 X?dt gives the per- 
turbation result AE < —« (second-order perturbation) and furthermore this 
result is shown to be an upper bound (a result proved only by much greater 
effort with the usual methods). 

However, it is possible to imitate the physical situation much better with a 
more judicious choice of Sy and thus obtain a much better estimate of E. The 
next most obvious thing to try for Sp, is the action for an electron bound in a 
classical potential V(X). This choice can be shown to be equivalent to the use of 
some trial wave function in the ordinary (Ritz) variational method. In particular, 
if one chooses V(X) to be a Coulomb potential, one obtains for E (at large «) 
the same result as that which follows from a trial wave function of the form 
e~*". If one chooses V(X) to be a harmonic potential, then one obtains an 
improved estimate for E that could also be obtained with a trial wave function 
of the form e~ *** (at least at large «). 

However, it can be shown that for « less than approximately 6, no V(X) can 
improve on the result V = 0! This indicates that a classical potential is not a 
very good representation of the physical situation, except possibly at very large 
binding energies. There are two major reasons for this. 

First, the electron is not constrained to any particular part of the crystal but 
is free to wander. Any potential V(X) obviously tends to keep the electron near 
its minimum. 

Second, we see from the form of the exact action that the potential the 
electron feels at any “‘time’’ depends on its position at previous times, with a 
weighting factor e '*~‘!, That is, the effect the electron has on the crystal 
propagates at a finite velocity and can make itself felt on the electron at a later 
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Electron , Particle 
<< 


Mass = 1 Force =-K¢ M 


Fig. 8.8 An electron coupled by a “spring” to another particle of mass M. 


time. This is less so for tighter binding, where the reaction of the crystal occurs 
much faster; hence a classical potential might well be expected to be a good 
approximation only for tight binding. 

A model that would not suffer from either of the above objections would 
be one where, instead of the electron’s being coupled to the lattice, it is coupled 
by some “‘spring” to another particle and the pair of particles are free to wander. 
See, for example, Fig. 8.8. The trial action for the system after the coordinates 
of the mass M have been eliminated is 


B. BOB 
So = | X? dt < | | [Xu — X@l2e7W"s! dt ds, (8.41) 
2 0 2 0 0 


where W = \K/M and C’ = MW?/4. 


Of course, we could have seen mathematically that Sy would overcome the 
objections raised above without deriving S, from any physical model, because 
we are free to choose any form for the trial action that we please (if it is not 
complex). However, the recognition of the physical nature of S greatly helps 
in avoiding mathematical difficulties when questions of boundary conditions, 


asymptotic behavior and so on, arise, and also may facilitate computation 
(see below). 


We write according to our variational principle 
1 


where E) is the binding energy of our model and ¢ ) means “‘average with weight 
e So.” 


Let 
I(K, t,8) = Ce 8-20) 
[ fX FOX) e— So DX 
Fe OX 
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If we can evaluate I(K, t, s), then it is easy to find (S — S)): 


eae AI, \, Spe a 


eo ({. |. {X(t) — X(s)}*e" VIF! dt is). 
2 \so Jo 
For example, 


BB BB 
(| | |X(t) — X(s)|*e~ S| dt is) = | | dt dse~ W""~s|(1X(t) — X(s)|?) 
o Jo o Jo 


BB 
-| | dt dse~ W!*-s\l _W21(K,t,s) ]x=o- 
0 JO 


In general, for any function f of |X(t) — X(s)|, we can evaluate <¢ f(|X(t) — 
X(s)|)> by Fourier analyzing f. 


1 ) Are!K IXO—X(S)1 3K d?K 4n 
a ee Oe Gee ee I(K, t, s). 
|X(t) — X(s)| K (27) (2n)’ K 


To evaluate /(K, t, s) we will work with the more general 


I(g, t,s) = (exp [ i g(u) - X(u) iu) : 
0 


and then will specialize to the case g(u) = K[d(u — t) — d(u — s)]. We may 
proceed in several ways to do this general path integral: 


a) If we divide “‘time’’ into small intervals, we have a problem of the form 
| | [oes dX, dX3°*:exp (-» A; ,X ;X ; + i )) B,X)). (8.43) 
ij i 


This integral has been developed* for A;; a positive definite and self- 
adjoint matrix. The result is that the expression 8.43 becomes: 


(n)"? (Det A)~"/? exp [-4 ¥ B,B(A7*),/]. (8.44) 


LJ 
Here, n is the number of time intervals in B, and it may tend to infinity after 
dividing by the proper normalization to make the quantity approach 1 as 
B; — 0. With this method we must insert the end-point conditions X, = X5 = 0 
by any one of several artifices (such as an added term in the driving force B). 
b) Alternatively we may proceed by recognizing from the physical model 
that S, can be broken up into “normal modes,” that is, a free particle with the 


total mass M + | plus a harmonic oscillator of frequency v = Jw? + 4C/W 


* B. Friedman, “Principles and Techniques of Applied Math,” Wiley, N.Y., 1964. 
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and reduced mass M/(1 + M). We have already done the path integrals for a 
driven free particle and a harmonic oscillator, and hence we may write the 
desired result immediately. 


c) Alternatively, we may proceed to perform the integral by a technique* 
(used in previous lectures for solving the simple harmonic oscillator) that 
consists of finding the path, X’(t), which makes the driven action an extremum 
and noticing that (within a normalizing factor) the desired result is 


1 (4 
I = exp > | g(t): X(t) ar]. 


We will use a fourth method. Let 
X(t) = X(0) + ¥ a, ia. 
| =1 
Note that X(t) satisfies the requirement that X(0) = X(f). Integrating over all 
paths is equivalent to integrating over all X(0) and all a,. Working in one 


dimension (for simplicity), we first must find Sp in terms of X(0) and a,. 


By2 00 2 
ac =, y 4 cost OE gy 1S G22 ™ (R45) 
0 oO 0 B 4 


c (Fé _ wie 
a | [x(t) — X(s)’e7 "5! dt ds 
o Jo 


ai |, > a sin TE ia = \lens dt ds 
p B 


1/8 2 
W pa tas + n? at on ce) 


Equation (8.46) holds only approximately because we have made certain 
approximations that are valid in the limit of high B. Then 


: 


B re) re) 
i g(u)X(u) du = X(0)by + » a,b, = », a,Dn» (8.47) 
0 n=1 n=1 
where 
B 
b, = i g(u) sin du = iK (sin — sin). 
; ; p 
And 


I(K, t, s) = IJ IE oo exp cap en (A,a? — a,b,) | da, da,:-- 
7 fff. exp — (2%, A,a?) da, daz: 


* R. P. Feynman, Phys. Rev. 97, 660, (1955). 


, (8.48) 
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where 


2,2 
eg mm (44 4C/W . 
4B W? + n*n?/p? 


From Eq. (8.48) it follows that 


00 b? 
mae (=, 4A, ) 


exp (> _—K "(sin (nat/B) — sin (nns/B)) _ 
n=1 n?77/B(L + (4C/W)/\(W? + n?n?/B?)) 


I(K, t, s) 


. (8.49) 


As B approaches oo, the sum in Eq. (8.49) becomes an integral with 
nr/B — x, n/B = (2/B) dn > dx. This integral can be evaluated to give 
2 (Ww? 4C 


K 
I(K, t, s) = exp | —— 4——~ |[t — s| + 
( ) P| a | WV? 


[1 — elt-slV 


4 e &+s)V —_ te 2" = ama , (8.50) 


where 
V? = W? + 4C/W. 


I(K, t, S) is used only in integrals over s and ¢ with large B. In such integrals 
there is a contribution only when K?W2/2V2|t — s| is not much greater than 
unity, that is, in an area of order 2BV?/K* W? (in the plane formed by the s and ¢ 
axes). In essentially all of the regions of integration, the fact that f is large 


implies that e~ “Tt”, e~?"*, and e~?"5 are very small, and we can write 


K? (W? 4C 
2 V2 wv? 


I(K, t, s) = exp es lt — s[| + [1 — eran, (8.51) 


From Eq. (8.51) we get for one dimension 


BB 
(| | (X(t) — X(s))?e" "8! dt is) 
o Jo 


BB d? 
— “ | dt dse~ Wlt-sl ae I(K, t, yy) 
; dK 


0 


2p 


~ —— as B > o. 
x=-0 VW 


For three dimensions we must triple the result (since V - K = 3 in three dimen- 
sions). We find that 


C ae ig _ 2,—-Wlt—s| es 3BC 
5 ( \ \ IX(t) — X(s)|2e ) a (8.52) 
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We can now find the E, we wanted in Eq. (8.42) as follows: e~ 4? = 
{ eS QX implies that, for large B, 


! , 1 d — BFo(C) 


= 7 “I | | dt ds|X(t) — K(orenme") 


3 3 
VW w/w? + 4c/w 
Integrating, and using the fact that E,(0) = 0, we have 
E,(C) = 3(V — W). (8.53) 


To get (S — S > we must evaluate 


She ee ra \atds= | OS ee | tase ge dt ds. 
X(t) — X(s) (2n)3 K? J, J, 


Let I(K, t, s)e~'!*~s! = g(|t — s|). Here we are assuming Eq. (8.51) to hold 
because f is large. Then it is easy to show that 


BB B 
{ { g(|t — s|) dt ds = 2 | (B — u)g(u) du. 


Since g(u) = 0 for large u, Bg(u) > ug(u) whenever there is a contribution to the 
integral; so we can neglect ug(u). We can also let the upper limit of the integral 
be oo. Then 


Uf (asa) 
|[X(t) — X(s)| 


_ ( d°K 2p(4n) (? _fK* Wu 4C uv 
= Jiaee ae |, [fae ts ] + 4] 


_ 4BV [” e" du 


ee ee DS eee Se (8.54) 
J2nJo JW2u + [(V? — W2)/V]1 — e-%”) 
Using Eqs. (8.52), (8.53), and (8.54), we arrive at the final result: 
E < Ey + 44S ~ %) 
- = due " 
—(V — Ww) — iF" 
Bo ar ) Va Jo [W7u + (V? — W*)IV)A — eW*")]1/? 


(8.55) 
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We may now vary V and W to obtain the lowest upper bound. (These results 
are the lowest ever obtained as upper bounds for this problem.) 


For small «. The best W = 3 and V = 3(1 + 2a(1 — P)/3W), where 


P=—[(d — W)'? — 1]. 


i 


From this, we obtain 
E < -—a — «7/81 = —a — 1.23(a/10)? (8.56) 
The correct result from perturbation theory to this order is 
E = —a — 1.26(a/10)?. 


For large a The best W = 1 and V = (4a7/9z) — (4(In 2 + C/2) — 1), 
C = Euler Mascheroni constant = 0.5772,..., and 


E < —a?/2n — 3(21In2 + C) — 2 + O(1/a?)::: (8.57) 


which, up to « = 10°, is lower than any other estimate, upper bound or 
otherwise. 


For intermediate « A very great advantage of the method here is that it 
gives a smooth result for E from weak through intermediate through strong 
coupling, and it gives the only known reliable results for this problem in the 
intermediate coupling region. Unfortunately, a little machine calculation is 
required here to perform the integration in 1/B¢S). In case you would like to 
compare some other calculation, we give in Table 8.1 a few results obtained by 
~ Schultz* using Eq. (8.55) and a computer. 


Table 8.1 
i 3 5 7 9 11 


Upper bound of E: — 3.1333 — 5.4401 — 8.1127 — 11.986 — 15.710 


When using an approximate method, we like to have some idea of the size 
of the error. Our approximation was to assume that the inequality 


represents an equality. To check the error involved in this assumption, proceed 
as follows: 


<e") 


2! 
oD + UCS2) = CP) 


* T. D. Schultz, ‘“‘Electron-Lattice Interactions in Polar Crystals,” Thesis, MIT, 1956. 


(ase Fabel +> 
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For small f, a good approximation is to neglect all but the e ‘J? term on the 
right. But a better approximation (for small f) is to take 


feS) = exp[—<f> + Hf? — <f>*)] + higher-order terms. 


Taking e fF Fo) = (e~ S~ 50. we get 
1 1 
Fw Fo + 3 — Soo - 2B Ki Sol7>o iS = Soo] 


+ higher-order terms in (S — So). 


If S — S, is not small, the correction term, 


! 2, _ ve _ ¢ 
ay [<IS — Sol?) — <S — So>*], 


may not be an improvement, but it should be an order-of-magnitude estimate 
of the size of the error in the uncorrected estimate, F ~ Fy + (1/B)<S — So». 
Such an order-of-magnitude estimate of the error in the polaron energy has 
been carried out, and it turns out to be small. 


8.5 EFFECTIVE MASS 


A variational principle along the above lines has not been found for the calcula- 
tion of the effective mass of the polaron because we need paths e~* where S’ 
is complex. In fact, for the lowest energy of an electron with velocity U we 
need to path integrate using 


S'(X) = 5/8 dt — 2a | |X — X, + 2iU-(t — s)|~te"!*~5! dt ds 


= S(X, + iUt), 


as we would expect in “imaginary-time’’ space. However, if U were imaginary, 
then we would again have a real S’ and could use the variational method. Now 
one can show that E(|U|), considered as function of a complex variable, is 
analytic near |U| = 0. Hence Fin the neighborhood of |U| = Ocan be obtained 
by knowing E for imaginary |U|. Therefore, one can deduce that for small 
velocities the effective mass is given by 


0 t 


7 ee 
m=1+ pe Nay? | ure (8.58) 


o [Wr + (VV? — W/V) — eo )P?° 


where V and W are the best parameters found for E(U = 0). It is interesting to 
note that Eq. (8.58) gives a value for m that is always within a few percent of the 
total mass, 1 + M = V?/W7?, of the trial model.* 


* For more details, see R. P. Feynman, Phys. Rev. 97, 660 (1955). 


CHAPTER 9 


ELECTRON GAS IN A METAL 


9.1 INTRODUCTION: THE STATE FUNCTION o 


The actual behavior of a metal is very complicated: The metal electrons interact 
with the lattice, with the lattice vibrations, and with one another. As a first 
approximation, we will assume the lattice to be rigid and will neglect the mutual 
electrostatic and magnetic interactions, except insofar as each electron is 
affected by some average potential due to the other electrons and the periodic 
lattice. 

Hence, the Hamiltonian is given by 


h*V2 
H=h, +h, +°°:; h, = — —* + V(R,) (9.1) 
2m 
and the energies given by 
hyu; = €;U;, ES » Nn;é;, and > n; = N. (9.2) 


i i 
From the Pauli exclusion principle, n; = 0 or 1, and the wave function of 
the entire electron gas must be antisymmetrical. At moderate temperatures, the 
electron gas is almost in its ground state. That is, the N electrons occupy the 
lowest N states—almost. For a gas with all spins in one direction (for example, 
all up or all down), the ground-state wave function is 


1 
Qt Sp 
JN! 
>p Means a sum over all permutations, an even permutation having weight | 
and an odd permutation, weight —1. PR,, which might better have been written 
Rp;, Means a permutation of the electrons. The normalization factor | If N! 


will be derived later. 
With these definitions of )\p and PR,, it is seen that Eq. (9.3) can be 
written as 


> (= 1)"u,(PR,)u2(PR>2) a (9.3) 


g = = x Det [u{R))] = 


vN! 


1 u,(R,) u2(R,) 
—— |u,(R2) u(R,) °°). (9.4) 


JN! 
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Suppose that V(R) = 0. Then (reviewing results obtained in Chapter 1), we 
have 


272 iK-R 
h=-— iad and u(R) = ee (9.5) 
2m = V 
The energy is* 
h? K? h?K? d?K 
E= ee) peeeeuiete —— VF(K). 9.6 
» » i | 2m (2n)° ) oe) 


V = volume and F(K) = 1 if |K| < Ko, F(K) = Oif |K| > Ko, where hKo is 
the maximum momentum, that is, Ey = A*K%/2m = Fermi level. 


Ke v2¢2 2 2 
E=V| h?K* 4nK Kayil 1 gs 


~ "| om Qne = 5 2m2n? 
Ko 2 (9.7) 
voy ane) <a | 
9 (2z)° (27) 
Thus, if p97 = N/V ande = E/V, 
3 mWwK2 1 ae 
e= = — 6n7)?/3p2/3 = ape’. 9.8 
ae oe ne. "po Po (9.8) 
It is sometimes convenient to define rp and r, so that 
1/p>o = volume per electron = 47/3 r@. 
r= r/Qo 
where 
dy = h*/Zme? = Bohr Radius. (9.9) 


r, 1s of course dimensionless, and for most metals, r, is between 2 and 6. The 
energy is often expressed in Rydbergs where 1 Rydberg = me*/2h7. 

The results of Eqs. (9.7) and (9.8), plus, of course, many more, can be ob- 
tained from ¢ of Eq. (9.3). First the normalization factor appearing in Eq. (9.3) 
will be derived.t We require { p*p d°R, d°R,-:-d°R, = 1. Consider a 
specific term in the summation of Eq. (9.3), without any normalization factor. 


Such a term is, for example, u,(R,)u.(R,)::-. Because the uv; are orthonormal, 
ut (R,)uz(R2) ** + uy(R)u2(Ry) °°: d°R, d°R, rss = 0. 
Also, 
| wi@nute) + u,(Ry (Ry) “+ PR, d? Ryo = 1, 


* We assume periodic boundary conditions. 


+ The factor 1/N! did not appear in Eq. (6.125) because th enormalization there was 
1/N! {o*g d?R,:-:d°?R, = 1. 
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In other words, 
| ote Rayus(R a d?R, d°R, ee L. etc. 


There are N! terms of the type u,(R,)u2(R2):-: and thus { o*o d°*R = N! 
for an unnormalized g. When u,(R) is normalized, the normalization factor is 
therefore 1 Jr/ N}. 

The kinetic energy T can be calculated in the same manner as the 
normalization integral. 


h?V? h°v? 
T =[o0,) - — = — wy od RR, 9.10 
(9.5 mo) [ore (9.10) 
Consider a single term of @, which is now assumed to be normalized. Such a 
term 1s /VN! u,(R,)u,(R,):-: and (dropping the i/VN)), 
—hV4/2mu,(R2)u2(Ry)*** = uy(R2)u3(R3) ‘(— h’Vi/2mu,(R;)). 
Although it is not a priori evident that 
(—#?V{/2m)uz(R,) = A,u2(R,), 


where J, is some eigenvalue, this must be so since electron number one has 
nowhere except uw, to go. That is, V7 operates only on u,(R,) and does not 
bother other electrons. The argument used to evaluate the normalization 
integral now may be used, and since 


Since V = 0,* the kinetic energy T equals the total energy E and the uw; are 
plane waves. Hence A, = h*K?/2m and 

2ye2 2KV2 3 m2 2 

E_yitK* _ h'K" pep) 2K h*K > 


PO es Lea EE a8 
Vo 2m 2m 1) Op ae 


(9.11) 


Equation (9.11) is the same as Eq. (9.8). 
The energy per electron = 3(h7/2m)K¢ = 2.22/r2 rydbergs, where Eq. (9.9) 
has been used. 


9.2 SOUND WAVES 


It is interesting to ask what the velocity of sound cg is in an electron gas. It 
can be found by introducing a perturbation of density p = p,,, + (dp) cos kx 
and determining how it propagates, or more directly by calculating cZ = dP/dp 


* Try not to confuse V = volume with V = potential. 
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where P is given in Eq. (1.53) and p = the mass density (not the number density 
Po). Then 


ie EE sss SUNOS oe (9.12) 

6p = J/3m 3m 

The velocity of an excitation 
eee eee Ve. (9.13) 
h 0Ko m m 

where P, is the Fermi momentum (not the pressure). 
Equation 9.12 is a general result; if the velocity of a gas particle is a constant, 
C, the velocity of sound is Chl 3. For, suppose P is the pressure of the gas, 
e its internal energy density and p its internal momentum density. Then, to 
first order in the sound perturbation, we have, using standard hydrodynamics, 


pe ave 
é = —C*V:p (energy conservation). 


But for constant velocity we also have P = 3. Combining these equations we 


obtain 
2 


¢—~ ve = 0, 
3 
which represents a wave propagating with velocity C/V3. 

The perhaps startling conclusion may be drawn that sound propagation in 
the collisionless electron gas is impossible. In other words, the energy of a den- 
sity disturbance will be dissipated very quickly. This may be seen as follows: 
Consider a Fermi sphere (Fig. 9.1). 


Fermi surface 


Fig. 9.1 A Fermi sphere. 
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A given excitation of momentum K can increase the total energy of the gas 
by as small an amount as we please (for instance when an electron is excited 
from A to B tangentially to the Fermi surface). Thus a great many energy levels 
of excitation exist for a given momentum increase. These energy levels are 
below the energy of a sound wave and hence the sound-wave energy will be 
quickly dissipated. 

9.3 CALCULATION OF P(R) 
We now remove the restriction V = 0. Let 
V = VV (R, R)). 
ij 


The reader can easily verify that 
(» TVR, R je) =) | [uf(R,)ui(Ra) — uP(R,)U*(R,)] 
ij ij 


x V(R,, R,)[u({R,)u(R2)] d°R, d°R. 
(9.14) 


What is the probability, P(R), of two electrons (of same spin) being a given 
distance R apart? 


° 


tJ 


P(a, b) = (» YAR, — a) 4(R, - 0) 


is the probability that there is a particular at a and one at b. This is of the same 
form as Eq. (9.14), so 


P(a, b) = ¥) {lu(a@)|*u O° — [ur (@u,(@)][uju(d)]}. 9.14) 


Take V = 0. Then u, ~ e*'4/\/V, and ; > { d?KV/(2z)%, so that 


P(a, b) = |J"ear" Zz ae — ei(K-L)-(a-by] a 


elk: (a—b) F(K) d°K |? 
N (2n) 


(9.15) 


3 


P(R) = | d3a d?bP(a, b) (a — b — R) 


2 2 
aes {1 - : (sin KpR — Ky cos KoR) | \. (9.16) 
V K3R° 


Equation 9.16 is an important result. For example, it tells us that the hard-sphere 
model of a one-spin gas should be easy to formulate, since under most conditions 
the electrons are not very close anyway (P(R) = Oat R = 0). The short-range 
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P(R)/R2 


Fig. 9.2 Probability P(R) of two electrons (of same spin) being some distance R apart. 


Coulomb interactions are now understandably not important—although the 
long-range ones are. It is also clear that Eq. (9.14) could have been written as 


¢ VCR; R je) = |r b)P(a, b) d?a d°b. (9.17) 


The mean potential energy is the integral over a and b of the product of the 
potential energy between two particles at a and b and the probability of two 
particles being at a and Db. 

Suppose that one wants to know the number x of electrons that are in some 
region R. 


= ) R R;) = x | sce — a)R(a) d°a = | eConca da 
where 


R(a) = 1 nee R 
0 outside R, 
and | 


pla) = ¥ d(R; — a) 
is the density of electrons at point a. 
Fluctuations can also be handled. For example, using the definition of 


P(a, b), we have 


(9, 19) = (o. ¥ | 6(R, — a) (R, — 5) R@)R(6) da bo) 
zs | R(a)R(b)P(a, b) d3a db. (9.18) 


Px = | e(@er da 


248 Electron gas in a metal 


or 
x= yee B = YOR; — ale * da. (9.19) 


Clearly (px) = 0 for K # 0. But by squaring Eq. (9.19), we derive 
(pl) = ( loxl?@) = | eK P(q, b) d?ad*b = VS(K), 


where S(K) = f e“**P(R, 0) d*R = Fourier transform of P(R,0). From 
Eq. (9.15) we see 


aL 
(21)? ” 


1 if |K—L| < Ko, 
0 if |K—L|> K,, 


S(K) = — al F(L)F(K — L) K #0. (9.20) 


Recalling that 
F(K — L) = } 


we can show that S(K) is the shaded volume in Figure 9.3. Calculating this 
volume, we obtain 


3 3 
se eee for K<2Ky, K<#0, 
AnK3 4K, 16\K, 
(9.21) 


S(K) = 0, K > 2Ko. 


Problem: Calculate <|p,|*)> for sound waves. Are the fluctuations here 
greater or less than the fluctuations of the real electron gas? 


9.4 CORRELATION ENERGY 


Consider an electron gas with the background of uniform positive charge. The 
determinantal wave function ¢ for electrons (Eq. (9.4)) leads to the exclusion- 
volume radial distribution function P(R) plotted in Fig. 9.2. The Coulomb 
energy is then calculated as 


e _ e” bi a 4ne” 
(°.5 °) [qrmare Fz sk) sr O28) 


ij rij 


Fig. 9.3. Intersection of two spheres, radius Ky, centers a distance K apart. 
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where S(K) is the Fourier transform of P(R). Using the form of S(K) 
in Eq. (9.21), we calculate Eq. (9.22) as 


2 
(» > oa °) is ee rydbergs. (9.23) 
on ; 


s 


Here r, is defined in Eq. (9.9) as r, = ro/d) with ay = the Bohr radius and 
(4x/3)ré = the volume per electron. 
The total energy of this system is usually written as 


E = 2.22/r? — 0.916/r, + ¢, rydbergs. (9.24) 


The first term is the kinetic energy calculated in Section 9.1. «, is called the 
correlation energy, but actually it is the difference between the true energy E 
and the first two terms on the right-hand side of Eq. (9.24). &, is also sometimes 
called the stupidity energy E,. 

There are attempts to compute ¢,. When the electron density is low and r, 
is large, we see from Eq. (9.24) that the kinetic energy is negligible compared to 
the potential-energy part. When we distribute the electrons in a regular array, 
the body-centered cubic structure gives the lowest potential energy, which is 


Ee, = — tel rydbergs for large r,. (9.25) 
r 


KY 


Comparing Eqs. (9.24) and (9.25) we see 
— 0.88 
r, 


for large r,. (9.26) 


&. = 


Wigner examined the case of small r, also, and his best guess is* 


— 0.88 

E. 4 78 rydbergs. (9.27) 

When Eq. (9.26) was calculated, electrons were put on a body-centered 
cubic array with the wave function taken to be a 6-function. We may get a 
better estimate of energy if we use a different wave function. A calculation was 
done assigning a Gaussian distribution for an electron around a point of the 
body-centered cubic lattice and varying the shape of the Gaussian distribution. 
A reasonable numerical result was obtained. 


9.5 PLASMA OSCILLATION 


In a conductor, electrical resistance increases when impurity ions are introduced, 
the extra resistivity being due to the scattering of electrons by the extra positive 
charge. However, the resistivity calculated from the scattering is much larger 


* See Wigner, E. P., Phys. Rev. 46, 1002 (1934); Trans. Far. Soc. 34, 678 (1938). 
Also see, for correction, footnote in Pines, D., Solid State Physics, 1, 375 (1955). 
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than the observed value. This indicates that the extra positive charge is partly 
neutralized by the higher density of electrons gathered around the positive 
charge. In the Fermi-Thomas model, for example, the variation of the electron 
density is related to the (slowly varying) potential @ by 

On 9 ORE 20 


n 2E, 2Ep 


where E;, is the Fermi energy. Thus 


2 
div D = div E — 4ne6n = ha Ze |e. 


so that the dielectric constant is 


2 2 
e(k) = ca Orne ee 
Obviously, this means that the field of a point charge decays to zero over a 
distance of (E,/6mne”)'/* (the Fermi-Thomas wavelength), and so does 6n in the 
neighborhood of the point charge. The point charge is therefore screened (if 
positive) by the electrons, which form a small cluster around it. 

The above applies also to each one of the individual electrons in the con- 
ductor, on being now negative, so that each electron creates to himself a “‘hole”’ 
in which it moves. This is beside the effect of Fig. 9.2. The same kind of 
fluctuation of electron density causes the plasma oscillation. It can be calculated 
as follows: 


Let us consider a one-dimensional sinusoidal fluctuation in the (number) 
density of electrons: 
P = Pay + (dp) cos kx. (9.28) 


The potential energy arising from the change of local density is written as 


E.= 1 [oe (5p)* cos* kx dx, (9.29) 
2 | dp” 


where ¢ is the energy of the Fermi sphere, which depends on the density. From 
the theory of sound we know that the velocity of a sound wave cy is written as 
d*2/0p* = (m/pjy)Co: (9.30) 


where m is the mass of an electron. For the plasma of charged particles, we have 
another contribution to the potential energy, namely the electrostatic potential. 
It is calculated from the Poisson equation 


—V?V = —4ne(6p) cos kx, (9.31) 
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whose solution is 


= 4ne(5p) Leg (9.32) 


The electrostatic potential energy is then 


25 | (—ep) d(vol) 


_il ane 
2 


Adding Eqs. (9.29) and (9.33), we find the total potential energy to be 


cos” kx dx. (9.33) 


E, + E,= ies ce — -) 69) cos” kx dx. (9.34) 


This quantity is equal to the kinetic energy of the wave in Eq. (9.28): 
Kaila o MO (6p)? cos? kx dx, (9.35) 


where w is the angular frequency in the wave: 
b= Ae Orn k =). 
Equating Eqs. (9.34) and (9.35) we may write 
c? = c} + 4ne?n,/mk?, (9.36) 


where we have put 
Cy = O,/X, 
and Ng is used for p,,. For the second term of Eq. (9.36) we introduce wp, the 
plasma frequency, as 
Wp = (4ne7n/m)''?. 


Then Eq. (9.36) may be written as 
= Jw + c2k?. (9.37) 


The minimum energy ee for exciting the plasma oscillation is fa,, 
which is about 10 to 15 eV. It has been observed by sending a beam of electrons 
through thin films of aluminum or beryllium. 

One might wonder how this semi-classical treatment fits into the quantum 
mechanical picture of the degenerate electron gas. To see this we note that the 
simplest way of creating an excitation of wave vector g from the Fermi sea 
is by the operator aja, with a certain allowed k, such that k + q is outside 
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the Fermi sphere whereas k is inside. Since we see no reason to restrict ourselves 
to a special k, we form 

O a Y f(Wah+ gtk: 
The average energy of the state O|0) is 


E= d (go + Ok, DISC)? + dX, (Vig) — Vik — kB) SP OSE) 


7 2 
(“k, @) = *-# 4 ©) 


and let us try to minimize it with the normalization condition 


y If? = 1. 


allowed 
k 


Obviously, this amounts to a diagonalization of a matrix A (allowed k’s only), 
A, = ( —A+ Ok, q)) Onn + Vg) — VK — k’). 


For large q, 0(k, q) seems to control the result so that the favorable excitations are 
electron-hole pairs, f(k) = 6, But when gq is small enough, the long range of 
the Coulomb potential suggests that V(qg) controls the result, so that the min- 
imizing f(k) is f(k) ~ const. Then the favorable O is 


O i >, Ah + gs 
k 


which is exactly the operator for a density fluctuation with wave number gq. 
We therefore conclude, by this qualitative argument, that for small q, 
density fluctuations should play an important role. 
Let us now see how we can make the microscopic picture more exact. 


9.66 RANDOM PHASE APPROXIMATION 


Bohm and Pines discussed the plasma oscillation in the following way. The 
Hamiltonian is 


H=)Yy Vi + = —. | (9.38) 


— fi 
| 2m 2 i#i Ni; 


The potential energy is Fourier transformed and written as 


— fh 1 Ane? alee. sees: 
Ha Voy 4 ty ey ek Pee RN]. (9,39 
2 Om w YK? b > ae 


(Subtracting N eliminates the self-energy term with i = /.) 
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We will proceed to write the equation of motion of density fluctuations. 
The Fourier components of the density operator 


p(x) = 5 a(R; — 2) (9.40) 


are 
PK = 3 eK Ri (9.41) 


Let us examine the Heisenberg equation of motion of px. 
The equation of motion of px is calculated as follows 


K-(P, + K/2) 


(9.42) 
a : 


Px = i(Hpx — pxH) = i ye’*™ 


where P, is the momentum operator of the /th electron. In deriving Eq. (9.42), 
we used the fact that px of Eq. (9.41) commutes with the potential energy of 
Eq. (9.39), and also we used the relation 


Pek ® — gk Rp = Kei ® (9.43) 


(remember that in our units # = 1). In evaluating px, we note that Eq. (9.42) 
no longer commutes with the potential part of Eq. (9.39), so that we obtain 


7 ik-r, LK « (P; + K/2)]° 1 4ne* ix. —ik’-Ry,iK’-R, j 
be =~ De® i —~ —~ ye le le IK: K’. 
i m V Pe mK 
(9.44) 


Now we make a crude approximation: 


ner, (K°(P, + KID. K?p2 em er Kp? 
penn Ee ta e3 PM Rey wm — XP, (9.45) 


where p, is the momentum at the Fermi surface. The “3” in the denominator 
comes from averaging over three directions. 

For the second term of Eq. (9.44), we separate the K’ = K term from 
K’ # K. The former gives 


2 Z 
4ne“ N ye Rs — A4ne“n be (9.46) 
m VWF 


where n = N/V is the number of electrons per unit volume. For K # K’, 


we may write 
2 


K- | a y ei(K—K')> Ri (9.47) 
l 


Bohm and Pines showed that this term is small for the relatively high-density 
case. When we neglect this term we call the result the random phase approxima- 
tion. This approximation is interpreted physically as the fact that the sum >, is 
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small if R, are distributed over a wide variety of positions, so that the various 

components making up the term tend to cancel. It should be emphasized that 

Bohm and Pines proved this approximation rather than simply assuming it. 
After these considerations, Eq. (9.44) simplifies to 


Px = —ORPK (9.48) 
with 
OK = p7K?/3m? + 4ne?n/m. (9.49) 
The last term gives the plasma frequency 


w, = 4ne?n/m. (9.50) 


9.7 VARIATIONAL APPROACH 


The plasma state can be investigated using a variational method. It can be 
regarded as the state of free electrons modified by plasma modes. Thus, we 
may use as a trial function 


PtTrial = ©XP | - » ala | P Frees (9.51) 
where px is the Fourier component of the density operator 
Px = en’, 


In Eq. (9.51) a, is the variational parameter. When we minimize the energy we 
find that Bohm and Pines’s treatment corresponds to the following choice of «,: 


2 
zal 2 for K< K critical 
— JhwpK (9.52) 


ok = 
0 for K > K critical 


Thus, the ground state for the plasma oscillation is 


2me 
@ = exp Fs > | Y Free: (9.53) 


Variational calculations done on functions of the above type* give correlation 
energies which were slightly larger for small r,, but coincide with Wigner’s 
values for r, = 4. These correlation energy values are also larger than values 
obtained by modifications of the RPA theory for larger (and more realistic) 
densities. t 


* T. Gaskell, Proc. Phys. Soc. (London), 72, 685 (1958). 
+ J. Hubbard, Proc. Roy. Soc. (London), A243, 336 (1957); P. Noziéres and D. Pines 
Phys. Rev. 111, 442 (1958). 
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9.8 CORRELATION ENERGY AND FEYNMAN DIAGRAMS 


We will be concerned next with finding the correlation energy of an electron gas. 
Although there is an easier way to do this (to be shown later) the present method, 
based on a paper by Brueckner and Gell-Mann* has the advantage of exempli- 
fying the Feynman-diagram procedure. The diagram method has proved very 
valuable and has wide application in quantum electrodynamics. Indeed it was 
for this latter field that the diagrams were invented (see Chapter 6, Section 12). 

The problem can be stated as follows: In a metal electron gas, the electrons 
interact via the Coulomb potential. The Hamiltonian is given by 


h? ee: eae 
H=-—YVW+ = -. (9.54) 
2m i 2 iF 5 Nij 

We wish to find the correlation energy AE due to the Coulomb interaction. 
Equation (9.54) can be converted into creation and annihilation operator 
language. Let a? be an electron-creation operator, that is, ap creates an electron 
of momentum P = AK with wave function e'*’*. Then ap annihilates an elec- 
tron, and as usual apap + apap: = Opp. With the methods of Chapter 6, 
Section 8, Eq. (9.54) can now be written as 


P? 2Qne*h? . 1 
H = ¥ — aja, t+ es at _part .odp.a 
ise PP V Looped P,—-Q“P2+Q“P2“P; 


= H, + Hi, (9.55) 


where the Q-dependence may be obtained formally by Fourier analyzing 
Eq. (9.54). However, Q is nothing more than the momentum transferred by a 
Coulomb interaction; 4ne?/Q? is the amplitude of such a momentum transfer. 
Several simplifications are possible. Chief among these is the introduction of 
b and b*, the annihilation and creation operators of holes. At T = 0, all states 
less than the Fermi level ¢) = P3/2m are filled. We then define b* and b as 
follows: 


ap = Ap if |P| = Po 
ap — ge if |P| < Po 
at =at if |PI> P, (9.56) 


at =b_p if |P| < Po. 


In other words, if an electron below the Fermi level jumps above the Fermi level 
we say that a hole and an electron have been created, and so on. We also have 


* Some references related to the correlation energy are: D. Pines, Solid State Physics 
I, 367 (1955). D. Bohm and D. Pines, Phys. Rev. 92, 609 (1953). M. Gell-Mann and 
K. A. Brueckner, Phys. Rev. 106, 2, 364 (1957). D. Pines and P, Noziéres, The Theory of 
Quantum Liquids, W. A. Benjamin, Inc., Menlo Park, California, 1966. 
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Po = h/ary, where V = N(4n/3)r3 defines ro (N is the number of electrons, V is 
volume of the gas) and « = (4/9z)'/*. The momentum, P, will be expressed in 
Fermi momentum units (Py) = 1) and the energy in rydbergs. 

From ordinary perturbation theory 


AE = <0|H ,|0» ao > (A on A 1) no 4. (Fy )om(A 1) mn no Anas 
n#0 Ey — E, n#0 m#0 (Ey — E,, (Eq — E,) 
(9.57) 
Equation (9.57) may be written as 
AE “equals” (0|H,|0) + <0|H, ——-——H,|0) 
Ey — Ho 
1 1 
+ <O|H, ———— H, ——__-H,|0) + ---. (9.58) 


Fo — Ho = Eo — Ho 


The quotation marks around “equals” in Eq. (9.58) are to indicate that the 
equation is valid only if a comment is added to it. This comment will turn out to 
be surprisingly simple when stated in Feynman-diagram language. We will come 
to this shortly. The notation of Eqs. (9.57) and (9.58) is familiar. That is, 
(0|H,|0> = ) (oH, o dx and 


(Ay) yn = | othe. dx, etc., where HoQ,, oF E,Pn- (9.59) 


Before continuing, let us try to see the “‘sense”’ of Eq. (9.58). First, it is easy to 
demonstrate the (near) equivalence of Eqs. (9.57) and (9.58). By the ordinary 
rules of matrix multiplication, 


1 1 
O|H, ————-H,|0) = | H, ————H 
<0| ‘Fr —H 110 (4, Ho 


0 0 


= X (Ay )om (sa), = (9.60) 


But, excluding pathological cases, 


| ess o)~, ax = f(E,) | ete. dx = Smnnf(En)- 


ea fete 
Eo — Ho mn "Ey — Ey 


and Eq. (9.60) equals 
>. (H von 1)no ; 


n Ey — E, 


Hence 
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The comment to be added to Eq. (9.58) is that when we multiply out all matrices. 
we exclude terms with n = 0. One could express this more precisely by writing 
(1 — |0><O})/(Ep — Ho) instead of 1/(E5 — Ho). 

Second, it is possible to understand more directly why Eq. (9.58) represents a 
perturbation expansion. This will not be demonstrated here except to say that 
the key to the demonstration is the matrix identity convergence* 


1 1 


= B 
A+B A 


= u es et, (9.61) 
A 

We return now to the mainstream of our problem. Using Eqs. (9.55) and 
(9.56), we can illustrate the possible fundamental processes allowed by H, 
by the set of diagrams in Fig. 9.4. A downward arrow represents a hole and an 
upward arrow an electron. The space below a dotted line stands for ‘before 
the interaction” and above, ‘“‘after the interaction.” 

Some comments on Fig. 9.4 may prove helpful. Process (a) is the only one 
that can start from vacuum or the ground state. The interpretation of (a) is 
that an electron and hole are created, the difference in momentum Q is carried 
along the dotted line, and another electron-hole pair created, the momenta 
being such that the total momentum is conserved. In general, each diagram 
junction represents a two-particle interaction with conservation of momentum. 
As another example, (c) describes the annihilation of one electron and the 
creation of two electrons (of different momenta) and a hole. 

To demonstrate how Fig. 9.4 can be used to calculate the desired matrix 
elements in Eq. (9.58), we first focus on the second-order term, 


eee 28%), = (2) 
Ey — Ho 


ie 


(OH, 


The first operator, H,, operates on the ground-state wave function and thus the 
first operation must be (a) in Fig. 9.4. 

The next H, operation must get us back to the ground state, because 
{O|A|O> = Ounless A~y = ago. (The (Ey — H,)~* operation does not change 
the state.) All the possible operations are given in Fig. 9.4, and it is quickly 


* Equation (9.61) can be proved as follows. Write 


en oo eee oe ae 
A A A A A A 
Multiply this with A from the left: 
AFQ ESBS RS Bo oe 1 — BS, 
A A A 
(A + B)S = 1. 
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Fig. 9.5 The operations of figure 9.4a and b added together. 


seen that only (b) added onto (a) can return us to the ground state or to vacuum 
conditions. See Fig. 9.5. 

Before discussing the amplitudes or numbers associated with each diagram 
of Fig. 9.5, let us pause to look at some third and fourth-order diagrams— 
Figs. 9.6 and 9.7. Figures 9.6(a) and (b) are both composed of the fundamental 
diagrams shown in Figs. 9.4(a), (b), and (e). 
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Pit+q 


Fig. 9.8 The first diagram of figure 9.5. 


When we reach fourth order a new ingredient appears. The diagram in 
Fig. 9.7 illustrates an evidently valid process. Note, however, that the diagram 
in this case consists of two independent parts—or unlinked clusters. It can be 
shown that unlinked clusters must be forgotten and not counted (because they 
involve intermediate states |n) = |0> in the expansion of Eq. (9.58)). In other 
words, the process illustrated in Fig. 9.7 is not valid and will not occur. The 
statement that unlinked cluster diagrams must not be counted is the afore- 
mentioned comment that must be added to Eq. (9.58). With the addition of this 
statement the quotation marks can be removed from that equation. 

Returning to the calculation of e‘*? we ask, what is the number associated 
with each diagram of Fig. 9.5? First, all fundamental diagrams (Fig. 9.4) have 
amplitude 2e*h?/VQ* where Q is the Coulomb interaction momentum 
(see Eq. (9.55)). Now consider the first diagram of Fig. 9.5, shown again in 
Fig. 9.8. gq, P,, and P, are arbitrary but then all other quantities are fixed. 
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1 
é?) = (O/H, ——— H, |0 9.62 
(Ol = HID) (9.62) 
from Eq. (9.58). From Eq. (9.62), we see that the amplitude for a given q must be 
24,2\2 
2neh’\’ 1 1 1 (9.63) 
vol q* q? E, — H, 


where |n) is the intermediate state in Fig. 9.8. Now 


EE =ito_ Pi, eo _ Pe 
2m 2m 2m 2m 


== (q? + 4-(P, — Pr)). (9.64) 


Note that the energy associated with a hole is negative since we are measuring 
energy from the Fermi level. 1/(E, — Ho) is called the propagator, and 
—m/|[q? + (P, — P2)-q] is called the propagator factor. (2ne7h?/vol)? 
(1/q*)(1/q7) is the coupling. Thus for a given P,, P,, q the amplitude associated 


with Fig. 9.8 is 
272\2 a 
a= (=O) ee (9.65) 
V q’q?q’? + (Pi — P.)-4q 


To find the total amplitude A; (associated with the diagram of Fig. 9.8) we 
must sum over P,, P,, and q, thus: 


P = 21 1 —m d*q_,, dP, d°P, 
‘i q? q? q? + (P, — Ps):q(2nh)>  (2nh)> (2h)? 


(9.66) 


The integral is such that if R stands for the Fermi sphere, then P, is in R, 
P, in R, and P, + qgand P, + gare outside R. Summing over electron spins 
gives us a factor of 4, two for each loop. 

Consider next the second diagram of Fig. 9.5, shown again as Fig. 9.9. 


Fig. 9.9 The second diagram of figure 9.5. 


9.8 Correlation energy and Feynman diagrams 261 


An examination of Fig. 9.9. enables us to write down the amplitude 
immediately: 


2ne7h? 2ne*h? 1 —m 
eee » (| (% rie al V Seer a ee or 


y3 d°q d°P, d°P, 
(27h)? 


f(P)F(PILL — f(Pi + q)\[1 — f(P2 - q)|. 
(9.67) 


Here 


1 if |P| < P 
f(P) = i ae 
0 if |P| > Po. 


The introduction of the last factor (involving the f(P)) enables us to dispense 
with the comments after Eq. (9.66) explaining the limits of integration. Now the 
integrations can go from — oo to oo. The sign of each amplitude is determined 
as follows. For Fermi-Dirac statistics (the present case), the sign is —1 for 
every closed “‘matter’’ loop. For Bose-Einstein statistics the sign is +1 for 
every closed matter loop. Figures 9.10(a) and (b) are each considered as one 
closed matter loop. Figure 9.11 is considered as two closed matter loops. 
Hence the sign of the first term in Fig. 9.5 is (+) and the sign of the second 
term is (—). To sum over spins for the second diagram of Fig. 9.5 gives another 
factor of 2 in Eq. (9.67). 


OQ CO 


Fig. 9.10 Each of these is considered as one closed matter loop. 


OU 


Fig. 9.11 Two closed matter loops. 
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9.9 HIGHER-ORDER PERTURBATION 


Consider the diagram of Fig. 9.12. It should be noticed that the interaction is 
always wave vector g. The perturbing energy contributed from this diagram 
is calculated (including the sum over spins) as 


(—ayrgs [ (2me#?)? dq (d2Pi (d2P2 (d5P3 ys 
Vq? } (2n)° J (2n)* J 2n)° J (22)? 


—m —m 
x Ms! aaa 2 ~ by] 
(4q7 + 4° P,) + Gq? + 4° P.) Gq? +4: P,) + Ga? + @° Ps) 


(9.68) 


where the integration is over the appropriate range. 

Because of the existence of the powers of g in the denominator, diagrams of 
this type diverge when integrated over g, even with the restriction on the range 
of integration. However, the final quantity of interest is the sum of contributions 
from these diagrams, and the final result is expected not to diverge. In order to 
arrive at the final result without getting involved in the divergence, 1/q? is 
first modified into 1/(q¢? + e?) using a small ¢, the integrations are carried out 
including e, and at the last stage ¢ is brought to zero. 

The worst divergences comes from diagrams such as those of Fig. 9.13. 
These are called the sausage diagrams, and the q’s are the same for all inter- 
actions shown by dashed lines. 


Consider the integrals 
d3p 
(2n)° 


F(t) = F(p)(1 — F(p + q))e” "2 +4) (9.69) 


and 


ail ae dt,::: dt, (t, + t3 +°°: 4+ t,) 


— 00 — 0 


x F(t )F (tz) °° * Fa(tn)- (9.70) 


The simplest case of A,, is 


A, = an dt, ° dt, {a a te 1197/2 44° Pig lt2| (47/2 +4 P2) 
d*p, d°p» 
(22)° (2n)° 


fos d°p, d°pr 1 7 —|t1|(q2+q°(p1+p2)) 
= | ay? fem? oe 


3 3 
_ | oP: | ee : (9.71) 
(2x)? } (22)? q?> + 9° (Pi + Pr) 
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Fig. 9.12 Diagrams of this type diverge when integrated over q. 
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Fig. 9.13 Sausage diagrams. 


where all integrals are over the appropriate ranges. 
Thus we see that the contribution from an n-loop diagram is 


d3q (2ne*h*\" 
C= A,(q)(— 1)". 9.72 
om ( = @-1) (9.72) 
In order to simplify A,(q), use the integral expression of the 6-function 


6(t) = | git au 


ae 2h 


so that Eq. (9.70) becomes 
1 du 


A, = 7 | a oe dye OF (EF gta) *** Felt) 5 


_1 | =| { aero], (9.73) 
n | 20 
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Use Eq. (9.72) in Eq. (9.71) to write the sum from all sausage diagrams as 


2ne7h 
= : 1)" 
-¥ es ms ada) r) (-1) 
(~ 1)” 2ne7h itu 
[esi py . ar Vq | ate F wo] 


d?q (du 2ne*h 
—In} 1 + dte'™F (t 
(2n)° iE Vq? | or ah )) Sel 
The integral over | dt--- has been worked out for small g and is 
; d*p __2Gq° + q'P) 
Ou) = | der (t) = 44 A EY Fp) 1 — F(p + 
q a Gadi ope (p)L (p + 9] 
~ An ( —  tan-! ‘). (9.75) 
q u 
For small g, 
An 
Qu) ~ —-. (9.76) 
3 u? 


Thus Q,(u) is finite for small g. 
The final answer for the contributions from all the sausage diagrams is 


2 
2d) iy ee I (9.77) 
nt 1 { R? dy 2 
where 
R=1-—ytan' y, 


9.78 
a = (4/9n)1/?. ae 


r, 1s the average spacing of electrons measured in units of the Bohr radius. It 
should be noticed that a In r, term appears in Eq. (9.76). The final result for the 
total energy is | 

_ 2.22 0.916 


Ps rs 


+ 0.0622 In r, — 0.096 + O(r,). (9.79) 


Here, the last constant term — 0.096 is the exchange term calculated apart from 
the diagram sum. 

It should be remembered that the mathematics in this section summed 
only the sausage diagrams as shown in Fig. 9.13. Although it is true that each of 
these diagrams causes the worst divergence, the theory would not be complete 
if the rest of the diagrams like the one in Fig. 9.9 were neglected. Gell-Mann 
and Brueckner examined the contribution from these diagrams also and —0.096 
is the result; as was expected, it is smaller (for small r,) than the contribution 
from the sausage diagrams, but it should not be neglected. 


CHAPTER 10 


SUPERCONDUCTIVITY 


10.1 EXPERIMENTAL RESULTS AND EARLY THEORY 


Just about any material can be brought into a superconducting state, in which 
there is no measurable electrical resistance, by cooling it below a certain critical 
temperature. That temperature depends on the material; it gets as high as 
~ 21°K for Nb,,Al,Ge. Although the vanishing of the resistance is perhaps 
the most spectacular effect in superconductors, we can perhaps find clues to the 
cause of superconductivity by examining other effects. One such effect is the 
discontinuity in the specific heat at the critical temperature. 

The specific heat less the aT* contribution of the lattice gives the specific 
heat of the electrons, C,. Comparison of normal elements with superconducting 
ones gives the two curves shown in Fig. 10.1. For the normal element C, 
varies as T, because the thermal energy is kT and only those electrons within 
an energy range kT near the Fermi surface can be excited. Thus, the thermal 
energy varies as T°. 

Integrating the specific-heat curve shows that the superconductor is lower 
in energy than the normal element. 


Superconductor 
Normal element 


Fig. 10.1 Specific heat of electrons as a function of temperature for a superconductor 
and a normal element. 
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Meissner Effect. A changing magnetic field produces an electric field, so 
something with zero resistance cannot contain a changing magnetic field. Ifa 
superconducting sphere is placed in a magnetic field, the field lines will be forced 
away from the sphere. If a ring is cooled in a magnetic field until it is a super- 
conductor, the flux through the ring will remain after the external source of the 
field is turned off. Apparently the resistance is exactly zero, for experimentally 
the flux remains constant indefinitely (provided the ring is not allowed to 
warm up). 

More surprising is the Meissner effect. If a solid (simply connected) piece 
of superconducting material is placed in a magnetic field and then cooled below 
the critical temperature, the magnetic field is pushed out of the superconductor. 
Technically, some lines might be trapped in the object, because some parts 
reach the superconducting state before others. Furthermore, if the magnetic 
field is strong enough, it might not be pushed out at all. In such a case, the 
material does not become superconducting. Its resistance and specific heat are 
normal. Because of its magnetic domains, iron cannot be cooled into super- 
conductivity. 

The Gibbs function is defined asG = F + PV where Pis the pressure and V 
is the volume. Here the pressure can be taken to be the energy per unit volume 
required to push the field out. From classical thermodynamics, the Gibbs 
function has the property that in a reversible change of phase at constant 
temperature and pressure, G does not change. So the critical field is characterized 
by 


2 
baler V = Ga rerond = Gigenal =F normal: (10.1) 


F supercond — 
ST 


At the critical temperature, H,, = 0, and as the temperature decreases Fy oima 
increases, so that H,, increases. 


T, 


Normal 


Her 


Fig. 10.2 Variation of critical field with temperature. 
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London observed, on the Meissner effect, that if » = density of electrons, 
mass = m, charge = —e and electric field = £, then acceleration is given by 


—eE = mx, 
and the current density j is 
—nex = jf. 


In a magnetic field of vector potential A (if we use cgs units), 


gives a rate of change of current density of 


dj _ _, 94 
dt at 


Hence, 
—A(A ms Acne) = J 


where A = ne*/mc = constant. Aon is constant in time, but may vary with 
position. It is fixed by the Meissner effect, so that no arbitrary magnetic fields 
can be put into the superconducting region. 

London proposed that A,,,, be taken equal to zero for superconductors. 
We then must satisfy the boundary condition that j,oma1 = 0 at the surface of 
the superconductor, so that A,o.maj = 0. We do this by an appropriate choice 
of gauge, called “transverse gauge.” 

We conclude then that 


j = —AA. (10.2) 


Equation (10.2) implies a modification of the statement that there is no 
magnetic field in a superconductor. Since 


Vie oS Ae (10.3) 


there is no sudden drop in the magnetic field to zero when we enter the super- 
conductor. In one dimension, for example, 


A o exp(+ /4nA|c x), 


where the sign in the exponential is chosen so that A decreases as the distance 
into the solid increases. The magnetic field penetrates to a depth of order 


700 A (/ c/4xA). The existence of a finite penetration depth can be determined 
experimentally by measuring the diamagnetic susceptibility of small drops of 
superconducting material, or by working with thin films. 

The above theory suggests where the A might come from and approximately 
how big it should be. But remember that electrons in a metal are not free, so A 
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is not exactly ne?/mc. The theory can be made to correspond with reality better 
by taking j = —AA’, where A’ is A averaged over the position using an appro- 
priate function. For example, in one dimension the average might be taken as 
follows: 


A’ = normalization constant x [ Aoyeeor dy. 


For “hard” or “type II’’ superconductors, € is much smaller than the penetration 
depth. For “‘soft’’ or “‘type I’ superconductors we have large €. Impurities make 
a superconductor hard. 

A further contribution by London came from consideration of the quantum- 
mechanical electric current, which is —e times the probability current: 


— he 
2im 


y*vy — (Vh)*Y). 


j= 


In a magnetic field the momentum operator becomes p + e/cA, so that the 


current is then 
“inte to tye-[Cr eT 
2m i Cc i c 


i 2 
=f Fyeyy — ww] — £4 vy. (10.4) 
2im mc 


~ 
| 


With A = 0 we get 
. he 
Yc t a LWoVWo Pe (Vio) Wo] = 0. 
2im 
When A ¥ 0, W_ changes into wy. If W & Wo, then Eq. (10.4) gives 


joe tiie 22 ee (10.2) 


We see then that if the wave function is “‘rigid”’ (that is, if it does not change 
when A is introduced) then Eq. (10.4) implies Eq. (10.2). What could cause this 
rigidity? 
From perturbation theory, we have 
(n|H pert|O> 
= Le spree emaeeerirsansl (977 
ie Cage E, — Eo ee 


where E, is the ground-state energy and E, is the energy of an excited state. If 
there is a gap between the ground-state energy and the energy of the first excited 
state, then E, — E, is large and f X Wo. 
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T. 


Fig. 10.3 Variation of energy gap A with temperature. 


The assumption of an energy gap can also explain the anomaly in the specific 
heat. Instead of the energy varying at 7”, a gap would cause it to vary as 
e A/kT where A is the size of the gap. More direct confirmation of the existence 
of a gap has been afforded by experiments involving microwaves. The energy 
needed to excite the material across the gap could even be measured as a function 
of temperature, and A decreases as the temperature increases. 

It took almost fifty years for the problem of superconductivity to be 
reduced to that of explaining the gap. In what follows we will explain the gap 
following the theory of Bardeen, Cooper, and Schreiffer. This theory is essenti- 
ally correct, but I believe it needs to be made more obviously correct. As it 
stands now there are a few seemingly loose ends to be cleared up. 


10.2 SETTING UP THE HAMILTONIAN 


The energy gap is A ~ kT, ~ 107° eV/electron, which is a small quantitative 
effect to produce a large qualitative effect. Thus the Coulomb correlation energy 
is too big, and can therefore be neglected. 

If this reason for neglecting Coulomb effects seems odd, remember that we 
are not trying to explain and predict everything about the solid. We are just 
trying to understand superconductivity. If some effect is associated with an 
energy that is too large, then we know that effect is not the cause of super- 
conductivity. Similarly, as we know that the energy associated with super- 
conductivity is small, we can predict that certain phenomena (such as e* e7 
annihilation) are not going to be affected much by superconductivity. 

If we change the isotope out of which the superconductor is made, the 
critical temperature changes. Spin-spin and spin-orbit interactions do not 
change with changes in the isotope, so they should be neglected. 
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The speed of sound is definitely dependent on the mass of the atoms of the 
superconductor. It is plausible then to guess that superconductivity has some- 
thing to do with interactions between the electrons and the phonons. Let us 
try, then, a Hamiltonian of the form 


H = » Exdx ax + py OpCe Ce + y Mxw Cr - 4K: 4K 


K,K 


+ Y Mk Cg KOK ax: (10.5) 
K,K’ 


ag creates an electron, Cj creates a phonon. é is the energy of an independent 
electron of momentum K, and wx (f# = 1) is the energy of an independent 
phonon. We measure the energy of the electrons from the Fermi surface, and 
holes are treated as electrons with negative ¢. The last two terms of Eq. (10.5) 
represent the interaction between phonons and electrons. Usually this inter- 
action is taken to be responsible for resistance. High resistance at normal 
temperatures implies high M, which in turn implies a special propensity towards 
superconductivity. 

There are many effects that cause large energy changes, but which are easily 
understood and have nothing to do with superconductivity. For example, the 
effect represented by Fig. 10.4 changes the energy of the electrons by an amount 
larger than the gap, but does not produce a gap. 

The diagram in Fig. 10.5 just corrects the properties of the phonon. 


Fig. 10.4 An effect that causes large energy changes. 


Fig. 10.5 An effect that corrects the properties of the phonon. 
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Fig. 10.6 A two-electron process. 
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Fig. 10.7 One electron distorts the lattice, the other is affected by the distortion. 


We see, finally, that the superconductivity must involve more than only 
one electron or one phonon. We must consider diagrams such as that in Fig. 10.6. 
Physically, this diagram can be interpreted as a consequence of one electron 
distorting the lattice, which in turn affects another electron. Figure 10.7 shows 
that two electrons K, and K, interact via a phonon, causing K,; and K; to 


come out. For the momenta the relations are 
K; =? K, = Q, 
K, = K; = Q. 


The initial energy is EFjnitiaa = x, + x, 1n the intermediate region, and the 


virtual energy is 
Evictial = €q, + EK, Ba hg. 
The perturbation energy due to the mechanism of Fig. 10.7 is 


] 
os , * 
V; — Mxx;, Mxix, 


Einitiat = FE virtual 
I 


ney |, 6 
(ex, + &,) — (ex, + &k, + hOg) 


— Mx,x;, 


(10.6) 
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K, 


Fig. 10.8 Another two-electron process, of similar type to Fig. 10.7. 


Figure 10.8 shows another mechanism of a similar kind. The perturbation 
energy coming from this mechanism is | 


1 
(€x, + Ex,) = (Ex, +- exs + hag) 


V, — Mx. x; Myex.,: (10.7) 
For the calculation of the perturbation energy, the initial state and the final 
state have the same energy: 


eK, + ex, = ox; + eKs or ex, = eKs = —(éx, —= Ex; )- 


The perturbation energy is the sum Vx: «;-x,x,- We are mainly concerned 
with electrons near the Fermi surface, so we can take all ¢’s to be approximately 
equal. Then 


VikiKo:KiK, ~ — — (Maox;Mxix, + Mxix;Mx3x,)- (10.8) 
Q 
If the M’s are approximately equal, the perturbation energy is negative; so the 
electrons near the Fermi surface attract each other. 

In metals, the Fermi surface is curved. If we disregard the curvature of the 
surface, the Hamiltonian can be treated exactly. However, when we do that we 
find no superconductivity. Therefore the curvature of the Fermi surface is 
essential for superconductivity. 

The Coulomb interaction among electrons gives the interaction energy: 


Ane? 
Veoulom’ = = (10.9) 
couiom? "1K, — Kil? + (const)? 
which is always positive, so the interaction is repulsive. The “(const)*” is a 
consequence of shielding. In order to achieve overall attractive interaction 
between electrons we need 


V a Vesnionis < 0. 
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Fig. 10.9 A case where |K, — Kj| is large. 


Looking at Eqs. (10.8) and (10.9) we see that for overall attraction when 
Ex, © &;, larger values of |K, — K;| are more favorable. For example, look at 
the case shown in Fig. 10.9. 

We can summarize these results by writing a new Hamiltonian for the 
electrons alone. The phonons have the effect of modifying the €x for the electrons 
and of modifying the interaction between the electrons so that the interaction is, 
under certain circumstances, attractive. We will be working with the 
Hamiltonian 


= + + + 
H — >. Exax Ax + De V ki K5:K Kp 4K, K34K,4,: (10.10) 
K KK;K1K2 


Our problem will be to discover how Eq. (10.10) can lead to a ground state with 
especially low energy. 


10.3 A HELPFUL THEOREM 


Consider a Hamiltonian 


The eigenvalues and the eigenfunctions of Hy are written as E; and @;. 
U;; = (9;, U¢;). 


If all the £;’s are nearly equal, and also all the U;;’s are nearly equal, then a 
large amount of lowering of energy can be achieved. This is shown as follows: 
Try a wave function 


y= py ai; 
and evaluate the energy expectation value for this state: 
—E= » E,\a,|* + » U,;aj‘a;. 
J 
Following the assumption let us put 
Ey Eo; U;, ~ —V. 
The normalization of the a;,’s is 
2 la;|” = 1. 


Then 
C= E,—-VY aja. (10.11) 
tJ 
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Suppose there are m states of the required nature; what is the best choice of 
the a,’s that makes € of Eq. (10.11) a minimum? It is* for 


when Eq. (10.11) becomes 
é = Eo —* my, 


Thus, when V is positive, or U;; is negative, the stabilization is m-fold intensified. 


10.4 GROUND STATE OF A SUPERCONDUCTOR 


With the theorem of the previous section as a guide, let us proceed to find a set 
of such ,’s for which the U;j,’s are nearly equal and negative, and the E;’s 
are all nearly equal. 

In the k-space, a wave function is defined by the configuration of occupancy 


and vacancy of states. For U;;, consider 


(Kas Kp... +5 ky, k|V|k,, hp, ..., ky, ke), 
where 


= + + 
v= ™ kik3skik2 %K{>%K3 Uk, Fk: 


We can consider this sum to be restricted to, say, ki > k, and k, > ky. 
Remember that a,,4,, = —4,,4,, and aj,a,,; = —ay.aj:. The two states con- 
sidered differ only in k;, k; and k,, k,. This matrix element is 


(Kegs Kips «oy Ky Key|V Keay Kegs 5 Ras ho) = EV esesstaey (10.12) 


The + sign comes from the ordering of k’s, and the rule is easy to obtain from 
Fermi statistics. The sign is (—1)%*™% where N = number of states between k, 
and k, in the function describing the initial configuration, and NV’ = number of 
states between ki, and k, in the function describing the final configuration. 
For example, 


(ky ki, Kp, LALZE k,,, Kp, k,) = (= 1) ed Ae 


Bardeen, Cooper and Schrieffer proceed to show that if a particular set of 
configurations is chosen, we can make V,.4+ 4,4, in Eq. (10.12) all real and nega- 
tive, and the sign in front of it always positive. The idea is to take the states in 
k-space always in pairs. This means that the pair of states k, and k, are either 


* To prove this, use the Schwarz inequality: 
Vata = |Y1-a| < YP La, 
ij je i Jj 


with equality if (a,,a2,...) < (1,1---), that is, if the a,’s are equal (and may be 
chosen real.) 
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Fig. 10.10 k-states arranged with pairing states side-by-side. 


both occupied or both empty, and the configuration in which one of them is 
occupied and the other empty is not allowed. When we make this requirement 
on the configurations we can show that the sign in Eq. (10.12) is always positive. 
Suppose all the k-states are arranged in such an order that the pairing states sit 
side by side as shown in Fig. 10.10. Using the rule (—1)%*™’ and the fact that 
N = N’ = O, we see that the sign is always positive. 

Because Vy: %,~, Should not vanish, we require for every pair 


ki + k, = k, + k, = p = const. 
This constant p can be chosen as any vector, but for the ground state we choose 
p = 0. 


Thus, the pairs we consider are 


How about spins? In computing the potential, we included the diagram in 
Fig. 10.12a but neglected Fig. 10.12b, plus and minus. The sign is + for states 
that are symmetric in space, that is, for states that are antisymmetric in spin. 
To make the potential as large and negative as possible, we take the states with 
antisymmetric total spin, that is with total spin equal to zero. We choose 


oe) = —d;.- 


> 


/ 
X 


Fig. 10.11 Pairs in the k-space. 
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(a) (b) 


_ Fig. 10.12 Included and excluded states in the potential calculation. 


To simplify the notation, the pair (kT; —k1) is denoted by “k” in this 
section. Consider all the pairs in the k-space and arrange them in a certain order 
kK, k,, Kk, eee 
A pair k; is either occupied or unoccupied. 

Let us define: 


|p,(0)> = the wave function for kt and —k{| both unoccupied, and |gx(1)) = 
the wave function for AT and —kJ| both occupied. 


Then a wave function that is a possible candidate for the ground state is 


> = [1]. (10.13) 
where | 
Wid = U,le.)> + Vi1@,(0)) 
and 
|Ugl? + [Vyl? = 1. 


The actual ground state is some linear combination of states of the form given 
in Eq. (10.13), but we will make the simplifying assumption that with appropriate 
choice of U, and V,, Eq. (10.13) is the ground state. By adjusting the phase 
of |g,(1)> and |g,(0)), we can take U, and V, real. Our program will first be 
to find the ground-state energy by varying U, and V, = fr 1 — U,. Then we 
will find the energy of the excited states, which will turn out to be a finite, 
macroscopic amount above the ground state. 
The energy of a candidate for the ground state is 


E= W\H|y> = » CWle,ay a, + 6 4a~,a_4lW> + » Vien Wage a= yaa _,|W) 
| (10.14) 


where all sums are taken over half the k’s and where Vj. = Vy —4sm,—4- Then 


E= > &,< Wy, lag ay + a*,a_,|Wy> + de Vien § Wael Wad ae 0 yp lV Wie. 
k ; 
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Let 
Sy = (Wylag ay + A= ,a-41W) 

and (10.15) 
th = (Wylag a7 l,>- 


Then 
= >: £5, + be Vevitete (10.16) 
k k’,k 
If 
Wi = U;|9(1)) + Vilp.(0)>, 
then 


Sk = DU. ty = U,V, = fe. 


Y 26,02 + Y Ve Ue VeVi Vee 
k k’k 


by 
| 


(10.17) 
The U,’s for the unperturbed state are 

Gs for k < Kegermi- 

U, = 0 for k > Keemi- 


The best choice of the ground state wave function is obtained by minimizing 
Eq. (10.17) with respect to the U,’s, V, being a function of U,. In the minimiza- 
tion it is not necessary to fix the total number of electrons explicitly. By varying 
U, and V,, we vary the number of electrons. If we choose an appropriate zero 
of the energy ¢,, we wind up with the correct number density of electrons after 
minimizing the total energy. We will take the zero of the energy to be at the 
Fermi surface, so that é,,,..; = 0. 


10.55 GROUND STATE OF SUPERCONDUCTOR (CONTINUED) 


Following our program we must next minimize Eq. (10.17) with respect to the 
U,’s. A set of equations results: 


U2 
4e,U, + 2 » Vung Ue Vig (% = “a — 0. (10.18) 
k 
Because V,,' is real and negative, we can define 
A, —_ y Vin Ue Vue > 0 (10.19) 
and we write Eq. (10.18) as 
1 — 2U; 
2¢,U, = A, - (10.20) 
V1 — U2 


Introduce x as 
Ux = 41 + x) (10.21) 
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so that 
1— U2 = 41 — x) 
and (10.22) 
1—2Uz = —x. 
Square Eq. (10.20) and use Eqs. (10.21) and (10.22) to solve for x’: 
2 
2 eK 
x=, 10.23 
E2 (10.23) 
where we define 
E, = Ve2 + A2. (10.24) 
Taking the right sign we obtain from Eq. (10.23): 
(1.2902) 24 4-2." (10.25) 
E, 


so that for e, < 0 (below Fermi level) x > O and U, > V,, as it should be. 
Putting this back into Eq. (10.21) we have 


(10.26) 


U2 is the probability that the pair k is occupied. The form of U2 in 
Eq. (10.26) shows qualitatively that the distribution U is a rounded Fermi 
distribution. Assuming A, is small, we see 


U2 -» 1 deep inside the Fermi sphere (¢, < 0) 
and (10.27) 
U2 — 0 far outside the Fermi sphere (¢, > 0). 


These limits suggest the distribution shown in Fig. 10.13. 


{ 
t+— Unperturbed distribution 


Uy, in Eq. 10.26 


Fig. 10.13 The rounded Fermi distribution. 
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~~ Lee” 


Fig. 10.14 The range of nonzero V4’, as assumed by Bardeen, Cooper, and Schrieffer. 


To complete the calculation we have to solve for A,. If we are to use Eq. 
(10.19), we shall find U,V,. Because Eq. (10.26) gives 


2 
U,V, = ; ‘i _~+ = 2s. (10.28) 


Eq. (10.19) can be written as 


— Y Vge ——_. . 
eo 9/62, + A2, 
This is the equation to be solved for A,. 

In order to obtain some insight, Bardeen, Cooper, and Schrieffer assumed 
Vix is a constant for k and k’ within the range fw,, above and below the Fermi 
surface, and zero otherwise. See Fig. 10.14. When we make this assumption, 
A, of Eq. (10.28) becomes a constant independent of k, and the equation for it is 

a 
KE 2s/e2 4 A? 
Changing >°, into an integral over ¢, we have 


h@cr 
4 ee de = Vimo | 
—hOcr Je + A? aa + ae 


M(e) is the density of states for energy ¢; it is close to a constant (0) near the 
Fermi surface. Solving for A, we obtain 


(10.29) 


1= -V 


1 = 


_ hw,, 


~ sinh(1/|V|M(0)) ’ ate 


10.6 EXCITATIONS 


Now we are ready to find the excitation energy. To describe excited states we 
will have to consider the possibility of a given pair of momenta being half- 
occupied. Define 


0,(2) = Ax, P(0), 0, (3) = ax 4, 9,(0). (10.31) 
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Then |w,,> is some linear combination of the four following orthonormal states: 


lW,(0)> = U,p,(1) + V,,(0) = the appropriate |y,) for the ground state; 
lW,(1)> = V,o,(1) — U,,(0) = |W,) for the excited state of a pair; 
lW.(2)> = o,(2) = |W,) for a single excitation, with —k| unoccupied; 
lW,.(3)> = @,(3) = |w,> for a single excitation with kT unoccupied. 

(10.32) 
If Wy, = Px,(2) and Wy, = U,9,(1) + V,.@,(0) for k # k,, then the energy that 


we will call E’, is given by Eq. (10.16). Take E, as the ground-state energy. 
Then s,, = 1 and te, = 0, so 


Ev — Ey = &, — 2e,,UK, = d Vue (UV n View) — Mm Vert Veg UVa) 
k 


1 — 2U;.) cara 2 ( MatUsM) OV i: 


We have used the fact that Viz, = View = Vix. Using Eqs. (10.19), (10.25), 
and (10.28) we see that 


2 2 
E’ — Ey = &, & } x a) (5) -'et4n op (10,33) 
E,. 2F,, E,, 


If Wy, = %x,(3), 54, = 1 and 4, = 0, so Eq. (10.33) still holds. If yy, = 
Vi, Pu,1) — Uy, ,,(0), the energy becomes E”. It is easy to show that s,, = 
2V es tk, = ae Viger and 


E" — Ey = 2E,,. (10.34) 


Note that F,, > Ay,. 
Suppose W,, = 9,,(2) and Wy, = ,,(2). Then, if we are careful not to 
count V,,,, terms too often, we get 


E = Eo = Ex, + Ex, + Viet ic View Vico ~ Ex, + Ex 
because V,..4,UK, Vie, U.Va, 18 an infinitesimal quantity. Thus, 


E 


two excitations 


— Eo = Ex, + Ex: (10.35) 


If Wy, = Ve, Pu) — Un, Px,(0), it is plausible to consider the excitation to be 
a double excitation, with both k, and —k, excited. 

We conclude that there is a gap between the ground state and the excited 
states, and that excitations consist of the breaking of pairs. The excitation 
energy is the sum of the E, for each excited electron in each pair. 
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10.7 FINITE TEMPERATURES 


To find the energy of a system at a finite temperature, we will use: 


Expectation value of the energy = £ 


= ) (Probability of state i) x (energy of state i). 


Describe a state by the product JJ, |W,,(7;)>, or by the sequence n,, n2,..., 
where n,; = 0, 1, 2, or 3. 

If P,(n) is the probability that |W) = |W,()>, then the probability of a 
state is []; P,,(”,;). Of course, >, P,(n) = 1. Then 


E 


> II P,(nj) b €%,5~(N) + > Veertentt Mths, (Nm } 


ni, n2; ees 


¥( ET] Pate) E Palmdensnin 


+d 1m 2; At P, (nj; a), YY Pr ( 1p) Phe) Viertinttey Mt k(Min): 


not nm, His 
But 


Yeo) =-1Ys) =T]1 = 
not ny a JFL nj j#l 


and similarly, 


p» J P, (nj) = 1. 


Of m1, an 


Thus, 
E= d » Py (M)E,, Se (2) + 2 2 Py (N)Px,(0 Verh! Mt y(t’). (10.36) 


Suppose we call f,, the probability that a k state is excited. Then the prob- 
ability that a given pair of k states is unexcited is 


P,(0) = (1 — f,)’. (10.37a) 
The probability that one of the states (say k) is excited, and the other is not, is 
P,(2) = P,(3) = fl — fh). (10.37b) 


The probability that both states of a pair are excited is 


P,(1) = f2. (10.37c) 
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From Eqs. (10.37) and (10.15), 
S,(0) = 2U x, t,(0) = U,V,, 
S,(1) a 2V is t,(1) a —U,V,, 
S,.(2) = S,(3) = 1, t.(2) = t,(3) = 0. 
Thus, from Eqs. (10.36) and (10.37), 


E= d 2e,[UK — 2f,) + fA] + DX View Uae Var UVa — 2f, A — 2f;,). 
(10.38) 


Here we assume that Uj + Vz = 1, but we do not assume that U, is such that 
F is a minimum for zero temperature. U, and f,, are functions of temperature. 

For a given set of orthonormal states U,, and f,, define a set of P; = prob- 
ability of state i. Then 


E =), EP; 


S 


—k Dy P;\n P; (k = Boltzmann constant) 


F(P) = E — TS. (10.39) 


If the P; are proportional to e~ ¥*/*", then F(P;) = free energy. It is easy to 
prove that if F(P;) is minimized with respect to P; (subject to the condition that 
> P; = 1), then the P; have the correct values for temperature T. 

Using the method of Lagrange multipliers, we find that 


Then P, = elAlkT— 1G Ei/kT 

A must be chosen so that >’ P; = 1, in which case the P; are correctly 
given for equilibrium. | 

For the case of superconductors at a finite temperature, minimizing F with 
respect to P; is equivalent to minimizing with respect to U, and /f,. 

Before we do this, note that we are really considering an F that has an 
undetermined number of electrons. We then minimize subject to the condition 
that the expectation value of the number of particles is a given number. So, for 
example, P,; should be written P(n,, n2,...) where 7,, 1,,... are the number of 
particles in state 1, 2,... respectively. In addition to the condition that 


yo P(m, m,...) = 1 


M1,N2,.-.. 


we have 


YY (my, + ny + ny +°-+)P(ny, M2,...) = fixed number. 


n1,N2; eee 
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The method of Lagrange multipliers then gives what amounts to the chemical 
potential of Chapter 1, Section 6. When working with superconductivity, we 
do not explicitly write the chemical potential because it is taken care of by an 
appropriate choice of the zero of the energy. 


S = entropy = —k ¥ probability of state i x In [probabilty of state i] 


—-k¥¥ P,(n) In P,(n). 
It follows that 
TS = —2p7 » [fiinf, + -f) Ind - AJ] 


5F 5E 6TS 


Se Se Os 
= 2é,(1 = 2U2) = 4) Vig’ U,. Vie U,V, oars 2h) + 2p} aa: It x 
~ Sk 
(10.40) 
Set 
Ey = &,(1 = 202) — zZ > Via Ue Vie UVC ae hi) (10.41) 
k’ 
Then 


fe = Af(e?** + 1). (10.42) 
If we now take 


SF _ 6E 


ei = | 4avs + 2% Vice U ye Vi (%- vale = 1) (1 — 2f,), 
(10.43) 


we have (in principle) equations that can be solved for both U, and f,. Now, 
define 


A, —_ — d Vine Ue Vie —- fy): (10.44) 
Then Eq. (10.43) becomes identical to Eq. (10.20). All our results from Eq. 


(10.20) through Eq. (10.28) are true at finite temperatures. Equation (10.41) 
becomes 


6, = &(1 — 2U2) + 2U,V,A, = & ie OR Re cae ip (10.45) 
= 2E, 


where we have used Eq. (10.44), then Eqs. (10.25) and (10.28), and finally 
Eq. (10.24). 
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The final result is 


A, — = y; Vien’ ae tanh 5 
. (10.46) 
Ez = e2 + Az. 
To simplify this equation, assume 
Viucn’ — —V < 0 for lé,| < ho,,- 
Then A, becomes independent of k, and Eq. a reduces to 
hOcr 
— Lae eel (10.47) 
M(0)V 0 E. <— 


where M(0) is the number of states per unit energy at the Fermi surface. The ‘‘2” 
in the denominator of Eq. (10.46) drops out because f°,,,.. de and [fr de are 
equal. When T — 0, Eq. (10.47) gives 


h@cr 2 
~ sinh-? er _ 9 { deexp[—ve? + WUKT] yy gg, 
M(0)V 0 Ve? + A? 
Assuming M(0)V is small, we find 
at wines AO) =H AatT 0. 10.48’) 
M(0)V A(0) ° 


It is easy to see that as J increases, A decreases. There cannot be a value for 
A at all temperatures, because in the limit of high temperatures we get an equa- 


tion independent of A: 
1 | Mer de hh, 


M(O)V Jo 2kT  2kT’ 


and this equation may simply not hold. 

In Eq. (10.47), A appears only as a square (in EF = Ve? + A”). The effect 
of decreasing A is to decrease E(e). But E(e) cannot be lowered below the value 
it has for A = 0. The critical temperature, then, is at A = 0, above which there 
is no solution for A. Putting A = 0 in Eq. (10.47) we have 


hOcr 
Bag) Sah 22 ein 1 (10.49) 
M(0O)V 0 |6«CE 2kT., kT. 
Equating Eqs. (10.48’) and (10.49), we get, 
2A(0) = 3.52KT.. (10.50) 


We write “‘2A(0)”” because in experiments in which a superconductor is excited, 
we create both a hole and an electron. Equation (10.50) is experimentally 
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A/kT, 


T/T, 
1.0 


Fig. 10.15 A(T) as a function of 7|T,. 


C.s/YT 


T/T 


Fig. 10.16 Specific heat of a superconductor, C,,, as a function of 7/T,. 


fairly good. Let T = T, — t. As Tt increases from zero it is clear that if Eq. 
(10.47) is to remain true, E must change from ¢ by an amount proportional to Tt. 


But E = Ve? + A? we + A?/2, so A? is proportional to t. That is, if T ~ T,, 
A? = A(T, — T) 
for some positive constant, A. We conclude that the general shape of A(T) is 


as in Fig. 10.15. 


Another check with experiment is the specific heat whose electronic part 
is of concern to us. The normal specific heat C,,, is proportional to 7: 


Co, = 71, 


e,U 


y being a constant. For the superconductor, C, , is plotted in units of yT,. It is 
as shown in Fig. 10.16, which is in agreement with experiment. 


10.8 REAL TEST OF EXISTENCE OF PAIR STATES AND ENERGY GAP 


Any phenomenon in which scattering of electrons is involved will serve as a 
test for the existence of the pair states. Attenuation of phonons and para- 
magnetic relaxation are examples. 
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k’o’ 


ko 


Fig. 10.17 Electron scattering. 


In such phenomena the interaction part of the Hamiltonian includes the 
form 


H’ = > Burg’ koFk'a' Akos (10.51) 
ko,k’o’ 
where o is the spin coordinate. Equation 10.51 represents the amplitude for an 
electron with ko being scattered into k’o’, as shown in Fig. (10.17). When the 
pair states proposed in the BCS (Bardeen, Cooper, Schrieffer) theory exist, a 
scattering of an electron kT induces an interference with the paired electron at 
—k. 
The coefficient B usually has a symmetry like that shown in Fig. 10.18. 
There are two cases to be distinguished: 


Buia! ko = Boppy yh ag! (Case I), 


(10.52) 
By at De pig ae (Case II). 


o’ ;sko 


In the ordinary scattering case we write a table: 


kT k't 
Initial occupied empty 
final empty occupied 


That is, we consider the scattering of an electron from kf to k’f. 

For the BCS case, the table should include kT and —k| together. When we 
speak of the scattering of a kT electron into k’T, we mean that we take the initial 
wave function with w, = @,(2) and Wy, = U,-o,(1) + V,-g,(0). Then final 


Fig. 10.18 Symmetry of the coefficient B. 
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wave function is W, = @,-(2) and Wy, = U,@,(1) + V,@,(0). Then the matrix 
element for the transition 1s 


(WolA' |W) = (Oe(2)1( U0.) + ViP (0) (Bre stey Meet Let 
+ B_4y—4/\@~4nj4-K')) 
x [p(2)) Up Gy) + Vi-p,0)) 
= BuetViVer + Bony Guu: 
= Busnes VN +£ UU y:). (10.53) 


The + sign gives the two alternative cases (I) and (II) in Eq. (10.52). Changing 
the sign from plus to minus causes a big difference in the scattering. Qualitatively 
we can understand the change when we consider the case k’ ~ k. Then we see 


V24U2=1; V2—U2=1-—2U2 =¢,/E, = &/Vez + AZ. (10.54) 


When the sign is positive, the scattering is enhanced, whereas when it is negative 
the effect is reduced. 

Phenomena such as attentuation of sound, thermal conductivity, and 
paramagnetic loss have been examined and the theory can explain the experiments 
well. 

Let us now discuss some gap experiments. Suppose we form a junction by 
placing a superconductor with a thin insulating layer (such as an oxide of the 
superconducting material) against another metal, which can be either a normal 
metal or a superconductor. Ifa voltage is placed across the junction, the amount 
of current will be affected by the presence of the gap. Semiquantitatively, we 
can set up the problem as follows: 

Label the metals 1 and 2. Electrons in a state described by “‘k’’ in metal 1 
will have an amplitude to go to state “gq” of metal 2. Let that amplitude be a 
constant, M, in the energy range that interests us (near the Fermi levels). The 
probability of transition from state k of metal 1 to some state of metal 2 is 


= y |M|? (probability of state k being occupied) 
q 
x (1 — probability of state g being occupied) x d(E, — E,), 


where E, and E, are the energies of the respective states. The probability of 
state k being occupied is 


1 


f(Ex) — exp [E,/kT] + 1” 


and similarly for state g. If voltage —V is applied to metal 2, the energy of 
state q becomes E, = E, — V, and the probability of g being occupied is the 
same as before, namely f(E,) = f(E, + V). 
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The current from metal 1 to metal 2 is 


ane S IM 2f(E, [1 — f(E, + VY) (Ey — E’). 


he kq 
If 
I = the total current = current from 1 to 2 — current from 2 to 1, 
we find that 
2me 
I= mo IM|*{ (ELL — f(E, + Vd) — f(E, + V)LL — f(E)]} 6(E, — E%) 
Q 


= iM? Y Lf(E) — fEq + VY AE, — EY) 


of 


Now x,q can be replaced by | dE, dE{p,(E,)p2(E,), where p,(E,) and p,(E,) 
are the density of state functions in the respective metals. Then 


KT) = 7% imp | dE[f(E) — f(E + Vy]p(E)px(E + V). 


At absolute zero, 


I(0) = am IM |? | ” dEp,(E)p,(E + V). (10.55) 


Thus the current depends on the density of states near the Fermi level, E,, 
for the two metals. For normal metals we can consider p(E£) to be roughly 
constant, but for superconductors the interactions between electrons cause the 
spectrum to be distorted. The state that would have had energy «, without the 
interaction actually has energy |E,| = V2 + A2. For simplicity, take A, = A 
independent of k. Thus there is a gap around E, = 0. While dN/de (which is 
the density of states if there were no interactions) is roughly constant, 


aN|_ aN 1 dN El 
dE, de, |dE,/de,| de EE? — A? 


p(E,) = 


rises to infinity as the gap is approached, then drops to zero within the gap. 

If metals 1 and 2 are normal, p,(£) and p,(E + V) are constants, and J is 
proportional to V. But if metal 1 is a superconductor, p,(£) is zero for all E 
in the integral of Eq. (10.55), unless V > A. Thus for V < A there is no current. 
For V > A, it is easy to show with our simplifications that J oc Jv2 — A?, 
and looks like the function in Fig. 10.19. At finite temperatures, the curve 
looks like the function in Fig. 10.20. 

This situation, with metal 1 a superconductor and metal 2 normal, can be 
visualized as shown in Fig. 10.21. At V = 0, Ep, is the same as E,,. When V 


lowers E,, more than A, below E;,, current can flow from | to 2. 


10.8 Real test of existence of pair states and energy gap 289 


O Ay 


Fig. 10.19 Current across a junction with one superconducting metal at zero temper- 
ature. 


Ay i 


Fig. 10.20 Current in the same junction at finite temperatures. 


Unoccupied { 
levels 


Unoccupied 
Gap <p, levels 
Er, 
Filled 
levels 


1 (superconductor) - 2 (normal) 


Fig. 10.21 Levels in a junction of two metals, separated by an insulating layer. Metal 1 
is a superconductor and metal 2 is normal. 
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Fig. 10.22 Current in a junction of the superconductors, at zero and finite temper- 
atures. 


Suppose both metals 1 and 2 are superconductors. The current as a function 
of voltage looks like the curves in Fig. 10.22. From the second curve in Fig. 10.22, 
both A, and A, can be obtained. For more details concerning the energy gap, 
see Progress in Low Temperature Physics, Vol. IV, page 97 and on (especially, 
page 140 on). 


10.9 SUPERCONDUCTOR WITH CURRENT 


In Section 10.4, we assumed that the electron pairs had zero total momentum, 
so thatk, + k, = 0. The more general situationisk, + k, = 2Q = constant 
for all pairs, in which case there exists a current. As before, we can describe a 
pair by a single momentum vector, k, but in this case, 


ki, =k+Q, 
k, = -k + @. 


Formerly, the energy of a pair was ¢, + &_, = 2&. But 


2 2 27,2 2n2 
© ae or + (-k + OF = 20 4222. 
2m 2m 2m 2m 


SO 
7 + oy = QE, + 2(h?Q?/2m). 


In analogy to what we did before, we define V§% as Vi+9,-n+9:%'+0,-k' +9" 
Notice, however, that V (2) = V,,, + terms of the order Q? or less. The change 
in V,,' of order Q? is similar to the doppler frequency shift in sound emitted 
from moving objects. Neglecting ““Doppler shift’ we modify Eq. (10.17) only 
by the replacement of 2e, by 2e, + 2h?Q7/2m. The energy of the ground state 
is increased by (47Q7/2m) >, 2U}. 
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The number density of electrons is ” = { 2U%, and we can define v = 
hQ/m. Then 
excess energy = WV(mv/2). (10.56) 


1= (v|7*p\v). 


where p is the momentum operator. It follows that 


The current is 


wae 
marae 4 
m 


J= Thy TE + Q)+ (-k + Q)U; = ~¢ "2 y 22 = — Vev. 
m ‘k m ‘k 
(10.57) 


How is the wave function changed by replacing pairs +k with pairs 
+k + Q? The equations for U, can be brought up to date by replacing 
g, with e, + h?Q7/2m. As well as 4?Q7/2m, we must add another constant 
to make the zero of the energy appropriate for the correct number density of 
electrons. The net result is that the zero of energy is again at the Fermi surface, 
and the U, come out the same as before. But now the U, describe the amplitude 
for +k + Q to be occupied. 

Suppose we describe the ground-state wave function by Wo(x,, x5,...). 
We want to modify this wave function to become Wo(x,, X2,...). This modifica- 
tion is effected by replacing each component of momentum p by a component of 
the same amplitude, but of momentum p + AQ. That is, e'?«'*e/*" is multiplied 
by e'2°*e, Then 

Wo = exp (2 => «.) Wo = exp [ a > a Wo. (10.58) 

Equations (10.56) and (10.58), it should be noted, represent simply the 
consequences of a Galilean transformation of Schrédinger’s equation. According 
to our approximations, a superconductor with a current is the same as a super- 
conductor in its ground state as seen by a moving observer. 

In a real metal the current can vary from place to place. Because k is 
large near the Fermi surface, the lowest-energy wave functions are not affected 
much in energy if the variation in current is slow. 

To see how to deal with variation of current with respect to position, 
assume a metal is divided into regions. Each electron is constrained to move 
within a given region, and for each region there is a different current. Then the 
wave function becomes 


.m .m 
Ww = exp(i— v,° » Xe} eXP[ lt — v2° > Xe)°°* Wo 
h electrons h region 2 
in region | 


e 


exp (" » (x) ° «. Wo. (10.59) 
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We can now let v(x,) vary gradually with x,. But this is not quite right. To see 
why this modification of w is inaccurate, remember that the equation for current 
density involves derivatives. The fact that v(x,) is not constant modifies the 
current. 

To improve our description of a current in a superconductor, let us next 
try 


vy = exp (1TH) Yo 
The number density of electrons is given by 
WAR) =F [Wi Ble, — BY dry dea 
and the current density is given by 
HR) = LM | [wtVeb — (Wap) exe ~ RY dy des 


If we remember that the current for ¥ = Wo is zero, we find 


KR) = - ve A vo. (10.60) 
m 


For example, if 0 = mv- x/h, j(R) is the same as that for Eq. (10.57). 

Note that we have only described currents for which V x j = 0. For the 
steady state, V-j = 0. 

In a ring, the above conditions can be satisfied with nonzero current. In 
this case, $j -ds = —We(h/m) AO, where (AQ) is the difference in 6 as we go 
around the ring. A@ need not be zero, but it must be a multiple of 27, for we 
require that the wave function be single valued. If we define v = —j/We = 
(A/m)V@, then 


e -ds = 2mn a ; (10.61) 
m 


where v is an integer. If the ring’s central hole shrinks to a point, the only way 
n can be kept greater than zero is to have lines along which the superconductor 
breaks down. 

By calling » = —j/We, we have implied that this quantity can be thought of 
as a velocity. This interpretation is confirmed by computation of the expected 
value of the energy. 


10.10 Current versus field 293 


p= [vty [v|z5 shutvly 

- Fy [var va + [yevy 
2m ‘e 

=) [ar [ovat VY) (a, — R) + |yve 
Mm e 


WV = WoVWo. (10.62) 
Also, 
Vi* Vb = VWs Vio + iVO- [WoW — WoVWo] + (VO) WoW. 


Equation (10.62) becomes (if Ey is the energy for YW = Wo): 


E 


E+ > | d?RiVO(R) | Y [WoVevt — WEVeo] d(x, — R) 
m e 


, | d°R|VO(R)|? Y Wowod(x. — R) 
2m e 


2 
Eo + t | enna "Jo(R) + = | aR ivace? rae) (10.63) 
e m 
But jo(R) = current for ground state = 0. Then 
E=E,+ | Ramo? N(R), (10.64) 


as we would expect from the interpretation of v(R) as a velocity. 


10.10 CURRENT VERSUS FIELD 


In the equation j(r) = —AA’(r), A’ is averaged over some region about the 
point r. In other words, we must use a sort of “spread-out” A(r). We can 
interpret this spreading as an effect of the size of the electron pairs, and we can 
calculate the spreading by means of the BCS theory. 

Empirically we have 


iu (ido gL ea 2) ed 


lr — r'|* 


(10.65) 
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where C is an appropriate normalization factor (so that constant A gives 
A’ = A). Note that Eq. (10.65) has the form of a convolution. If we write 


= [amen d*r, A(q) = | Anenies d3r, 
(10.66) 
Ka) = [Kine a'r, x) = EO 


r+” 
then 


i(q) = A(q) - K(q). (10.67) 


In what follows we will find the component of j(q) that is in the direction 
of A(q). 

Suppose that A(r) is in the z direction and has magnitude A je~‘4'*, where q 
is perpendicular to A (we are using a gauge in which V: A = 0). Then 


Aq’) = | Ace dx <= Ao Oq.q’> 
where everything is normalized within a unit volume. 

In Section 10.1, we showed that if the wave function is “rigid,” jp = —AA. 
In so far as the wave function is not rigid, we get a contribution from the 
paramagnetic current of 


jo a weve — WY fot =F + do. 
2im 


Set 4 = 1. Byconsidering w as an operator (as in second quantization) we could 
get the operator for j(r). But we will use a slightly different approach. 
For a single particle, the operator for j,(r) is 


ir) = — [P, or — ry) + 6 — 1,)Py], 
2m 


where r, is the operator for position and r is a vector (a set of three numbers). 
Then 


OO | Goes | (White)Wye™" dr 


=e { { ir, rb(ry) 5 V, dr — r,) + or —r,) 4 v,| Wren, 
2m 1 i 
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After some manipulation we find 
iq) = — Piet" + ee PW). 
2m 
Summing over all particles, we get 
= b —e ag | a a | —i “FI 
i@) = Liq) = ae Cy » [Pe i" +e" P]l>. —(10.68a) 
l 


By, say, ©, e @'"P,, we are representing the operator, acting on all the particles, 
that on one particle is e ‘4°"P (r and P are operators), Using 


ear — | axe tte 
and 
d°P 


P= 
(2n)° 


P|P)<P| 
it is easy to show that 


e9°p = kik — q)<kl. 
k 


In Chapter 6, Section 8, we learned how to find the corresponding operator on 
arbitrary numbers of particles. The result is 


ye arp, = ¥ kat(k — q)a(k). 
I k 
Similarly, 
Devers 7 py (k — q)a(k — q)a(k). 


Equation (10.68a) becomes 
: —e 
IQ = — WL Ok — q)ag_ gal. (10.68b) 
2m k 
Because g is perpendicular to the z axis (i.e., to A), 
—e 
JQ) = aie cyl »» kay g4\W>- (10.69) 


The wave function |) can be easily found by means of perturbation theory. 
The Hamiltonian for the superconductor with no field has a term 


y Pi 


Tr 2m 
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This is modified to become 
2 
t. D2 G ap a ; 
2m ‘1 Cc 


Using the fact that V- A = 0 and neglecting terms of second order in A, we 
see that the Hamiltonian is modified by 


H’ = — y A(x;) : P, — . > Aget’*'P, — Ag >; k,Qp+qA- (10.70) 
mc ‘1 mc ‘1 mc ‘k 
The wave function is 


> = o> + » Ann) 


where 


f = + 
An = (n|H |O» = eAo (nl De k Ay +44|0> ; (10.71) 
E, — E, mc E,, — Eo 


Here we are using |) and |wW,,) interchangeably. |W,» is the ground-state wave 


function, described by Sections 10.4 and 10.5. Putting together Eqs. (10.71) 
and (10.69), we obtain 


jg) x = K Y, keat_qaul0) + YAR Y kaa gaul0) 
+ DAKO Y kak guln) | 


We will calculate the third terms of the above result, and it should then be easy 
to see that the first two give zero. 


+ 
(0 F k,ag-ga,ln) = Cal (x kata) 10)". 


From 


+ 
( kak) = Vi kagay-g = Y kAk+q% (because k, + q, = k,) 
k k k 


and setting k + gq = k’, we find 


2 + 2 
e€ > |<n| Dk k,a,:a,,|0)| (10.72) 
mvc 


jAQ) = E. — Eo 


The ground-state wave function is a product of terms such as 


W,(0) = U,|kT, —kL> + V,|0, 0. 
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k,ax:,4,, operating on w,(0)W,(0) gives k,U,V,-|0, —kL>|k'T, 0>. In other 
words, k,a;:,a,, takes the state 
kt, —kl kt, —k’l 
occupied empty 
into the state 
(—k), (k’T) kf, —k'| 
occupied empty 
with amplitude k,U,V,:.. We can get a similar final state by using the operator 
—k,a* 4,44. This operator takes 
kt, —kl kt, —k’I 
empty occupied 
into the same state as that found by using k,a,,,a,,, but with amplitude 
: 7 k, U V,.- Then 
e Ao y |<k and k' excited|(k,a;°,4,, — k,a*,;a_,\)0>|? 
m2 C k E, + Ey: 
= e° Ao x kU Vier — Vie ye 
mc k E, + Ey 


iAQ) 


(10.73) 


The algebra in the above derivation is not rigorous. For example, the signs 
have not been unequivocally proved. The reader should do the algebra himself 
to check such things as the correctness of the signs. It is important to be con- 
sistent; decide, for example, whether |kT, —k| is a,a~,,|0,0) or a*,,a,',|0, 0). 
The following steps are straightforward: 


(U,V, = VU) = U.Va. + Vue —_— 20, V0. V qe. 
From Eqs. (10.26) and (10.28), the right-hand side becomes 


1 1— Ay Aye EnEK: 
2 E,E, )- 
and Eq. (10.73) becomes — iAg) = K,(q)4,(9)) 


2 
K..q) = x k; ie Ay Ag EEK: 
ae ck E, + Ey E, Ey 


with Ex = Vez + A2 (Eq. (10.24). 

Bardeen, Cooper, and Schrieffer find what amounts to K(q) for arbitrary 
components and nonzero temperature. They also show that at zero temperature, 
the result approximates the empirical formula Eq. (10.65), with &) = 
0.15 to 0.27 in place of the factor 0.18. Considering the approximations we 
(and Bardeen, Cooper, and Schrieffer) made, the theoretical results are very 
good. 
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10.11 CURRENT AT A FINITE TEMPERATURE 


To compute the energy of a superconductor with a current and at a nonzero 
temperature, we must modify Eqs. (10.36) and (10.37) as shown in Table 10.1. 


Table 10.1 


What was formerly 


Becomes 


€,5,(0) = 2¢,U? 
€,5,(1) = 26,V? 
€,5,(2) = &% 
€45,(3) = & 


P,0) = (1 — Ad — fy) =  — A)’ 


PA) = Si 
P,(2) = fxd — Sy) 


PB) = A — fi) 


The energy is 


(énaq + &K+Q)UR 
(x49 + é_pig Vx 
&K+Q 


E_k+Q 


(1 — fe+g)1 — f-x4+9) = Probability 
that neither (kA + Q) nor (—k + Q) are 
excited. 


Sr+of-n+o 


fu+gQ( — f-x+g) = Probability that 
(k + Q) is occupied (excited) and 
(—k + Q) is not. 


f-n+00 — Suro) 


E= d Gd — frog — F-n+o&+o + &_44Q)U, 


a Lier of-1+0(e+0 + & pig Vi 


ate dL Sev — fexsgek+g + di S-n+0) — frrgle-x+0 


she d Vie UM UieVe lO — frog — f-n+9) - Sx+ol-rio)] 


x [1 — ferro — f-x+0) — Su +ot-n +] 


-_ » [(Ex+0 + 8_p+9)U x1 — 2fnro) + ex+ofc+0] 


k 


a > ViQU Mi Ue Viel = 2fi+QU ~ fk’ +9): 


k,k’ 


(10.74) 


Here we have used U, = U_,, Vie = V_, and Vie’ = V _ age = Vie—k’ 
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So far, we know neither what f, is nor what U, is. As before, we must 
minimize F = E — TS, where 


TS = —B VY Y P,(n) log P,(n) 
— pt d [feito 108 fargo + f-n+0 log f_-x+o 


+ (1 — fxig) log 1 — fatg) + U - f-x+q) log — f-nio] 
— 2p7' d [fiero log fr+g + U1 — fe+g) log (1 — fx+g)]. (40.75) 


If we vary with respect to U, and let 


A, = - » Vie Uy a 2fx +0): (10.76) 


then we get an equation similar to Eq. (10.20). In fact, if we neglect errors of 
order Q7, we get Eq. (10.20) exactly, and everything from Eq. (10.20) to Eq. 
(10.28) is again true. More generally, we can define e(k) = 3(& 49 + &-n+Q); 
and Eqs. (10.20) to (10.28) hold with e(k) in place of é,. 

Next, vary with respect to f,49. The result is 


0 = —2e(k)U2 + e490 + 2A,UV, + Bo? log Jk+2 


1 — ferro 
= tsg — ok) + Ey + Bt log 22 a4 
~ Jk+@Q 
2]. 
= MEO. b+ p- log Sete, (10.77) 
m 1 — fro 


from which we find 


1 
exp {[E, + #(k- Q/m)]p} + 1- 


Equation (10.78) is what we would expect from a Galilean transformation 
of velocity v = 4Q/m. 

We might also expect that the current should be the velocity times the total 
density. In fact, this is not the case. The actual current is less than this, for the 
energy of excitations causes them to prefer to move in such a way as to oppose 
the flow. 

The effect described above should not be confused with “back flow,” 
which is instead an effect that comes about as follows. Consider the problem 
of a wave packet of excited electrons in a superconductor. Such a wave packet 
can be described by an approximate wave function, which, in turn, can be used 
to find energies and localized currents. But it turns out that the approximate 
wave function first used did not conserve current. An improved trial wave 


Seto = (10.78) 
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function (which does not improve the energy much) does conserve current. 
“Back flow”’ is the additional current that arises from the correction to the wave 
function. 


Now, let us see how much current we get from the wave function we have 
chosen for a uniform current in a superconductor. 


Electric current x m(—e) = momentum 
= » Probability of state i x momentum of i 


= d » P,(n) x contribution of the + k + Q states to the expectation 
value of the momentum 


a » d — fir gi - fens QU ilk a Q) a (—k : Q)| 


+ dX Suro — f_n+g(k + Q) + dL f-n+0( — furo(—k + Q) 
+ dL Surof-n+oVi [A + Q)+(-k + Q)] 
=@ d [2UE — 2fero) + eral + » K(fu+o — f-n+9) (10.79) 


The term multiplying Q can be written as 


OY 2Ui + OV — UDC) = v2 + 20¥ furore 


(see Eq. (10.25)). 


Dx Sn+ge(k)/E, is roughly zero. To see why, consider f,,,9 and é(k)/E,, as 
functions of energy, e. (See Fig. 10.23.) f,49 is roughly symmetric about the 
Fermi energy, and «(k)/E, is roughly antisymmetric. Their product is roughly 
antisymmetric; so it integrates out to zero. 


Sta 


e(k)/Ey 


Energy 


xe 


Fermi energy 


Fig. 10.23 The functions f,4 2 and e(k)/E,. 
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The term in Q is therefore, .@Q. The other term in the current is 
ee 
(2n)° exp {LE, + (k- Q/m)]B} + 1 
Up to first order in Q, the integrand of this term is 


dK Sa+0 “7 f-x+0) = » 2kfi+o = 2| 


k hk. kk: Ofm)BeEx? 
exp {[E, + (k- Q/m)|B} +1. ef + 1 (eFkB 4 4)? 


The only contribution after integration comes from the second term in the 
integrand, which by symmetry gives 
d*k k(k- O/m)Be®*? 2BO ( d*k — k?eFk? 
2 ae elle BLA J neal mpl 2 
(2n)> (eF*F + 1)? 3m J (27)? (ef 4 1)? 


Summarizing, we see that Eq. (10.79) leads to: 
j= -N ev, 
where WV, = effective density of superconducting electrons = VW — JW,,; 
a 
N = 2 | ——, U; = numerical density of electrons, 
(27) 
_ 2B ( d*k k*eEKb 


An = 3m | (m3 (eFe + 12” 
Q 
v= =. 
= (10.80) 


As the temperature tends to zero, f approaches infinity, .y;, approaches 
zero, and jy, tends toward sw. jy; comes from the excitations, and can be 
considered as a number density of normal (nonsuperconducting) electrons. 

To help us see how yw; behaves, let us make some approximations. 
Suppose that the integrand of jy; is appreciable only near the Fermi surface in 
k space. This assumption is a consequence of the assumption that f,.9 con- 
tributes only near the Fermi surface, as in Fig. 10.23. Then we can replace k? 
with k2, and 


dk 
(2n)° 


— [- M (0) dé, 


where M(0) is the density of states ate = 0. We have 


2 dE 
M(s) de = Ank* dk — 4nkmkdk _ 4nkm 


“Qn? Qn> mm (20 
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Therefore 
M(0) = 4nk,;m 
(2n)° 

Also 

_ 4 nk; 

3 (22)? 
The factor of 2 comes from spin. 
Putting all this together yields 
00 F Eb 00 ‘4 PAV 1 +x? 

Nn = BY a ( Ex + 1)? N ve xB (eb Av + x2 1)? = Ny(BA) 


Here y is a function defined as 


© zv1+x2 
e@ 
W(z) = | 


ep ate ea ete dx. 
agen 1) 


We already know that at absolute zero, y = 0. At the critical temperature, 
A = 0, FE, = s,and y = 1. In other words, sy, = Oat the critical temperature, 
and there is no superconducting current. 4, is given qualitatively as a function 
of temperature by Fig. 10.24. 

Without much effort we can generalize this derivation. Suppose that instead 
of simply minimizing F, we minimize F subject to the condition that there is a 
fixed expectation value of the momentum. We then do not get a stable state, 
but instead a state that will decay into a stable one. An outline of the procedure 
follows. 

Where before we minimized F, we now minimize F + V,-<P)>. The 
components of V,, are Lagrange multipliers with the dimensions of velocity. 


Ns 


N 


T, 


Fig. 10.24 Number density of superconducting electrons N,, as a function of 
temperature. 
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We fix the value of (P» at the beginning of the problem, and at the end of the 
problem we adjust V,, so that <P» has the right value. The result is 


pou yoyo (ee k 
m | (2m) exp {[E, + k-(V — V,)]B} + 1 


where V = Q/m. We have already worked with the integral in this equation, 
and it is easy to see that 


j= -NVeV + N,V — V,) = —-V eV — N,eVq. (10.82) 


Another result obtained using the same ideas Is 
Energy (at fixed entropy) = 41 ,mV* + 4W,mV?2. (10.83) 


It can be shown (with some difficulty) that V,, is the velocity of wave packets in 
the superconductor. 

Equations (10.82) and (10.83) can be summarized by the following slightly 
nonrigorous statement: 


In a superconductor there are two types of electrons—normal electrons, 
which carry currents that decay due to resistance, and superconducting electrons, 
which “‘short circuit” the current carried by normal electrons. 


This statement is misleading because jy, and ;, do not really count 
electrons. _j,,V is defined in terms of a certain integral, and jy, is given by 
N — WN; Ina similar manner, liquid helium can also be thought of in terms of 
a two-fluid model, and here, too, the model is satisfactory if used carefully, 
although it should not be taken too seriously. 


10.12 ANOTHER POINT OF VIEW 


Schr6dinger’s equation can be found from a variational principle of the form 
OO |- d°x dt = 0, (10.84) 


where the Lagrangian density is 


LW, Ww", Vw, VW", Ww") 


[CHT [ap ere tes 
m l Cc l c l 
(10.85) 


Finding y and w* such that { & d°x dt is an extremum is equivalent to solving 
the equations 


id hee of (10.86) 


5L/5y* = 0 
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where, for example, 


5 _ 0G dd’ Ad <s 


oy OW dt GW 1 dx, HOW/ax,) 
Using Eq. (10.87) with Eq. (10.86), we get Schrédinger’s equation and its 
complex conjugate. 


If we write y = J pe’, we can consider ¥ as a function of p and @ and 
their derivatives. Then 


(10.87) 


oY _ LO 5 OY 
dp oy dp dw* dp 


5L/50 = 0. 


(10.88) 


and 


In terms of p and 0, 
1 A\? h? 7 = 
Psp WO) a eG RGR ae eS. 
2m Cc 2m 2i 


It follows that 
tmv? + qV + U = —hib, 
(10.89) 
V-pv+ p=0, 
where 


v = (1/m\(4V0 — gA/c) and U= —(h?/2m)1/Vp V2Vp. (10.90) 
Incidentally, 


castes )e [ee y =r 
2m 1 Cc i Cc 


is the probability current, not the electric current. 

Now, how does all this apply to superconductivity? In superconductors, 
the electrons in the ground state come in pairs with opposite spin. The linear 
combination of such pairs with the lowest energy can be considered a particle. 
Since the spin is zero, the “‘particle’’ obeys Bose, statistics, and the ground state 
of a superconductor has huge numbers of “‘particles’’ in the same state. If p is 
such that w is normalized, Eq. (10.89) describes the motion of a single “‘particle,”’ 
but because the “particles’” are bosons, any number can be in the same state 
and p can be the number density. 

Note that g = 2e and m = twice the effective mass of the electrons. Note 
also that if p is the number density, we are obliged to modify Y in some way so 
that p does not deviate too much from some value characteristic of the material, 
say p,. We must add to ¥ a term 


a _ 2 
5 Ps)’ 


10.12 Another point of view 305 


a does not come from electrostatics, for the electrostatics specifies only that 
N;+ M, = constant, where jy, = p. We will not discuss « any further. 
Equation (10.89) can be replaced by the equations 


Vee ae 
ot 
mo + (v- | = q(E + v x B) + VU, (10.91) 
where 
pee. B=Vx A. 
c Ot 


Equations (10.91) are merely the equations of hydrodynamics. The first 
equation is the equation of conservation of the fluid, and the second is F = ma, 
with U interpreted as a pressure. Because of the «/2(p — p,)* term in Y, U is 
no longer given by Eq. (10.90). Just as Maxwell’s equations describe the motion 
of many photons in the same state, Eq. (10.91) are macroscopic equations for 
many superconducting “‘particles’’ in the same state. 

From Eq. (10.90), mv + gA/c = AVEO is a gradient; therefore its curl is 
zero. If B = V x A is zero, V x v = O and the fluid is irrotational. From 


Eq. (10.90), 
ho, — 6,) = |. (me + - ds. 
1 C 


If point 1 = point 2, we get 


A(2nn) = (me + a ds = = 1o(m= + 4) ds 
Cc q 
_ re 
= t pas + A)-ds, 
c 
where n is an integer and A’ = mc/qp, = —2m,c/2ep,. 
1. 2nhe 24 . 
(Aj + A)= —- n = —(2.09 x 107° gauss cm*)n. (10.92) 
e 


Hereafter, replace n by —n. 
For a simply connected superconductor (i.e., one without holes) V and A 
are continuous everywhere; son = 0. It follows that A = —A‘j.* But suppose 


* This equation is the same as Eq. (10.2). Remember that the j of Eq. (10.2) is the 
charge current, not the probability current as above. 
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the superconductor is in the form of a ring. The field does not penetrate very 
deeply into the ring (see Section 10.1), so there is no current deep in the ring. 
For a path of integration near the middle of a reasonably thick ring, 


2.09 x 1077n = pins + A)-ds = bards = || Bas (10.93) 


Equation (10.93) predicts that the flux through a ring is quantized, and this 
prediction is confirmed by experiment. Furthermore, if g were e instead of 2e, 
the coefficient of m would be twice as large as experiment gives. 

There are some other, very interesting, applications of the point of view 
introduced in this section. Consider an insulator between two identical pieces 
of superconductor (Fig. 10.25). 


If Ax is sufficiently large, the two superconductors do not affect each other, 
and 


h. 
a Wy ae E.W, 


h . 
3 Wo. = E,W). 


E, can differ from E, if there is a voltage across the insulator. If Ax is very 
small, the phenomenon of barrier penetration allows some of , to leak into the 
region of ,, and vice versa. Since Schrdédinger’s equation is linear and homo- 
geneous, the rate of leakage into region 1 is proportional to w, and vice versa. 
It can be shown that 


h. 
= Wy = Ey, + ao, 


P (10.94) 
oe Wo = E,W, + ayy 


where a is real if there is no vector potential. If there is a vector potential, 


a, = Ao eligA/hc): Ax 


Superconductor 
Vy 


Superconductor 
Vo 


| ax + 


Fig. 10.25 An insulator, width Ax, between two identical superconductors. 
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We will take a real, and leave the more general case for the reader to work out. 
If we write w = Vp e’®, Eq. (10.94) yields 
h 1, , fo ae pest , 
—" —_ 6, + Aby/p, = Ep, + av'p, exp [i(0, — 0,)] (10.95) 


and a similar equation with “1” and ‘‘2” exchanged. Equating real parts of 
Eq. (10.95) we find that 


P 


and exchanging “Il”? and “2” gives 


ho, = a jes COS (0, — 0,) + E,. 
P2 
Similarly, 


: 2a. fp. 
Pi, = —a he Vp1p2 sin (0. — 0;) = — pp. 


j = —/p, = the current that flows through the junction. Define 


Z Z 
0 = (0, — 4;), jo = 5 VesPa © 


If p, & po, we get 


J — Jo sin 0, 
dé _ E,—E, _ qv _ 2eV (10.96) 
dt h h h 2 


where V is the voltage across the junction. 


Now, suppose we construct the device shown in Fig. 10.26. If 6 = 6, — 6, 


he Junctions Srameengt 


Fig. 10.26 A two-junction device. 
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and 0’ = 0, — 6, then 
Jot = Current that flows freely through the device = jp(sin 6 + sin 6’). (10.97) 
Inside the superconducting part of the loop V8 = (1 f)(mv + (gA/c)), so 


2mn = change in @ as we pass around the loop = 6’ — 6 + ; (me + 3 ‘ds. 
c 


But if the wires and junctions are thick enough, v is small, and 
Inn = 6' 6+ 2OA-ds=5'—-54 22, (10.98) 
he hc 
where g = | B- dS is the flux through the loop. 
Equation (10.97) can be graphically represented if we notice that 
Jot = the imaginary part of j,(e + e). 


In the complex plane, 7 behaves as shown in Fig. 10.27. A small voltage can 
cause 6 and 6’ to change with time, but 6’ — 6 is a function only of the flux 
through the loop. With a given flux through the loop, the maximum value 
Jeo CAN Have is by simple geometry 


= 2j, cos 2 . (10.99) 
Ac 


Jmax = 2jo COS C=) 
2 
We predict J,,,, to vary periodically with g as shown in Fig. 10.28. The distance 
between successive maxima represents a vary small flux (of the order of 107’ 
gauss cm). Something like the graph of Fig. 10.29 is actually observed. 

Figure 10.29 differs from Fig. 10.28 in three respects: 


1. the scale is different; 
2. as increases, the oscillations die out; 
3. a nonzero lower bound arises for the oscillations as @ increases. 


The above experiment is analogous to the double-slit experiment in optics. 
In optics, if the slits have finite width, the intensity must be multiplied by the 
diffraction pattern of a single slit, and for large angles the oscillations die out. 
In our experiment, there is some magnetic field passing through the junction 
material, so 6 and 0’ are not uniform along the junctions. The spreading of 6 
along the junction corresponds to the “‘width” of the slit in the optical double- 
slit experiment, and it has a similar effect. 

The nonzero lower bound that appears as the oscillations die out is a 
consequence of the fact that v, the velocity of the electron pairs, is nonzero. 
The physical apparatus as pictured in Fig. 10.30 does not really have its left 
side identical to its right side, so $ v d, need not be zero. 


10.12 


Another point of view 
Jina 
o 
Fig. 10.28 Theoretical periodic variation of maximum current jmax. 
Jmax 


Fig. 10.29 Experimental variation of jmax. 


309 


310 Superconductivity 


If the magnetic flux through the loop is produced by a solenoid through the 
loop, the oscillations do not die out because no magnetic field passes through 
the path of the current. The fact that varying @ does cause j,,,, to vary shows 
that it is the vector potential, A, that directly affects the motion of the electrons. 
The B field was invented in order to describe the motion of objects in terms of 
something that exists at the position of the objects. Here the vector potential 
serves this purpose. 

We can run a current through the bottom film of superconductor in the 
device pictured in Fig. 10.31. If the bottom film is doubled on itself no magnetic 
field is produced in the loop. But now we have added a term 


m m|{. m : 
; [> ds hip, J: ds hap. qj: ds 
to the phase, 6 — 0’. 

The integration path is along the bottom film of superconductor. By 
measuring the electric current needed to produce a given shift in the position 
of the maxima of the oscillations, we can find m/fip,q. Recall that A = p,g/m. 

Another experiment we can perform is to rotate the device pictured in 
Fig. 10.30. If bis a vector, 


db\ _ (db 
dt a dt as measured in a system 
rot rotating with angular velocity w 


So, for example, if the position of a particle is called r, 


dr 
v)ot = {| — =v—@x FP. 
(V) ot Sh. 


AV, 04 d(v) 
ee (prise 52) = —“— — @ x (0), 
es ( dt ). dt (rot 


as iy Se se 
dt dt = 
= 2 OK” [@)ror + @ xr] —@x (Y )rot 
= at 20) X O -—@ X (@ X Pr). 
Neglecting second-order terms in «, we see that in the rotating frame there is 
an added effective force of 
q(¥ rot x 2m@c : 
c q 


that is, an effective magnetic field B = 2mwc/q. 


2m(v )rot x @ 
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Oxide Superconductor 


Fig. 10.30 The “‘double-slit” apparatus. 


Fig. 10.31 The same, with the bottom film doubled on itself. 


Going back to quantum mechanics, we see that this effective magnetic 
field causes an effective A to appear in the Hamiltonian. In our experiment 
with the rotating loop, the maximum current varies as the angular velocity is 
varied. By means of this experiment we can measure q/m. 

Devices related to the one described above may have many uses. They 
react so quickly that there are rumors that one can detect radio signals by the 
vibrations of the magnetic fields. More likely, such devices might be useful for 
measuring small magnetic fields simply by counting the maxima of the current 
oscillations. Perhaps they could be used to maintain fields (or rather fluxes) 
constant to a high degree of precision. Maybe they would be useful for informa- 
tion storage in computers. There may be circumstances in which they could be 
used as a strain gauge, by keeping B fixed and detecting changes of area. 


CHAPTER 11 


SUPERFLUIDITY 


11.1 INTRODUCTION: NATURE OF TRANSITION 


Liquid helium* undergoes a transition at 2.18°K, which may be demonstrated 
in a spectacular manner. When normal liquid helium, known as Hel, at the 
boiling point is cooled to 2.18°K, the boiling abruptly stops and the quiescent 
low-temperature modification, liquid helium II, (HeII) appears. HelII does 
not boil. This abrupt transition also can be demonstrated by specific-heat 
measurements (see Fig. 11.1). The shape of the specific heat curve resembles a 


c 


T (°K) 


Fig. 11.1 Specific heat of helium near the transition temperature. 


Pressure 


T (°K) 


Fig. 11.2 Phase Diagram of helium near the transition temperature. 
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(2? and as a result the transition temperature 2.18°K is often called the “A- 
point.” The specific-heat curve on both sides of the A-point is given empirically 
by (see 1.9.) 


e= a + binc(|T — T,)), 
Ge 

where a, is the constant relevant for T > T, (Hel) and a. for T < T, (Hell). 

This result has not been obtained theoretically and remains one of the unsolved 

problems of superfluidity. At very low temperatures the specific heat is propor- 

tional to T°, a relation that is understood. 


Properties of Liquid Helium: Liquid helium below the /-point, that is, Hell, or 
“superfluid” helium, exhibits several remarkable properties. 


1. As has already been noted, Hell does not boil (although there is evapora- 
tion). This can be explained by assuming the heat conductivity of Hell to be 
essentially infinite. 


2. More remarkable than the apparent infinite heat conductivity is the zero 
viscosity (under certain conditions) or superfluidity of Hell. It has been shown 
that below a critical velocity V,, Hell flows through a thin capillary or “‘super- 
leak’ with zero resistance. Furthermore, V, increases as the capillary diameter 
decreases. Observing flow through a capillary is not the only way to measure 
viscosity, however. If a cylinder is placed in a liquid helium II bath and rotated, 
there is a momentum transfer from the rotating cylinder to the helium, indicating 
that under the conditions of this experiment, the viscosity is not zero! These 
viscosity experiments can be crudely explained if it is assumed that Hell is 
composed of an intimate mixture of two fluids—one fluid with zero viscosity 
and density p,, and the other with normal viscosity and ‘‘density”’ p,.* Thus it 
is the zero-viscosity component that flows through capillaries and the normal 
component that interacts with the rotating cylinder. To explain the observed 
data, we must assume that p,/p, is a function of the temperature. 


3. A third remarkable property of HelII is the “thermomechanical” or 
“fountain effect,” together with the related ‘““mechanocaloric effect.”’ 


Consider two containers of Hell connected by a superleak (Fig. 11.3). 
Let the density, p, and temperature 7 be kept constant on each side. Then 
there will be superfluid flow through the superleak until the pressure head AP 
is equal to AP = ps AT. s is the specific entropy. This temperature difference 
giving rise to a pressure difference is known as the thermomechanical effect. If 
one container is a thin tube the pressure head will cause a fountain of liquid 
helium (Fig. 11.4). 


* The reason for the quotation marks about ‘‘density”’ will become evident later. 
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He II 


Fig. 11.3 The thermomechanical effect. 


: He IT 
Heat \ 


Fig. 11.4 The fountain effect. 


If the containers of Fig. 11.3 are kept at a constant pressure and there is 
mass flow from A to B, container B will cool. This is the mechanocaloric 
effect. If the two-fluid concept is extended, so that now it is assumed that the 
superfluid component not only has zero viscosity but also carries zero entropy, 
the thermomechanical and mechanocaloric effects can be explained both qualita- 
tively and quantitatively. or example, the cooling of container B in the 
mechanocaloric effect is exactly what one would expect if the mass transferred 
had zero entropy and hence zero temperature. 

The superfluid flow of a zero-entropy, zero-viscosity fluid also explains the 
abnormal heat conductivity of Hell. 

Most of the other “‘superfluid” properties of HeII can be explained in terms 
of the properties described above. For example, liquid helium II placed in a 
beaker will creep up the surface of the beaker and spill over the sides. This will 
continue until the beaker is empty (it being assumed throughout that the tem- 
perature of the helium is always below the J-point). This rather peculiar behavior 
can be explained by the ordinary physics of evaporation and the infinite thermal 
conductivity and zero viscosity of Hell. 

If any normal liquid is placed in a beaker we would expect that, due to 
Van der Waals attraction between liquid and beaker molecules, a layer of 
liquid, whose thickness decreases with height, will be formed on the beaker 
walls. In all liquids but Hell, however, the layer formation is inhibited by small 
but finite temperature, and hence vapor pressure, differences between the wall 
and the liquid. Thus, depending on whether the wall is warmer or colder, there 
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will be either rapid evaporation from the wall layer or the formation of droplets 
that will fall back into the liquid. In Hell, however, the abnormal heat con- 
ductivity prevents the establishment of any temperature differences, and as a 
result the layer creeps up the side and over the top. 


Landau Two-Fluid Theory: We have seen that both the viscosity experiments 
and the thermomechanical effect can be explained (crudely) by a two-fluid model 
for liquid helium II. A phenomenological theory using the two-fluid concept 
was introduced in 1940 by Tisza* and in another form in 1941 by Landau.f 
Since Landau’s view is the deeper one, we shall concentrate on his formulation. 

Consider HelII to consist of a perfect background fluid (which has zero 
entropy and viscosity), and some type of excitations, which for the moment 
may be regarded as phonons. This simple hypothesis explains a great many of 
the superfluid properties. (The model is somewhat analogous to a solid con- 
sisting of a background lattice plus phonon excitations.) The excitations accord- 
ing to the Landau view are the normal component. 

First, the specific heat of a “phonon gas” is proportional to T° at low 
temperatures in agreement with Fig. 11.1. Second; as the perfect background 
fluid flows through a superleak the phonons are inhibited because they collide 
with the walls. Thus the perfect fluid emerges with no excitation and hence 
zero entropy. The two-fluid model is further strengthened by the following 
ingenious experiment due to Andronikashvili.t A pile of closely packed discs is 
rotated in a Hell bath. The superfluid component is unaffected, but the phonons 
(and any other excitations) are dragged around with the discs and have an 
inertial effect, which can be measured. In this way the “density” p, of the 
inertial or normal component can be measured. (Fig. 11.5). 

The two-fluid concept suggests that the two components can oscillate out 
of phase in such a way that the total density of HelI at a point is essentially 
constant but the difference or ratio of the superfluid and normal components is 
not. The density of excitations, however, is a function of temperature, and hence 
as p,/p Varies So must the temperature. This leads to a new type of wave propaga- 
tion known as “second sound.”’ Second sound is a temperature wave and will 
be excited by heat rather than pressure pulses. 

According to the Landau view, one can think of second sound as a density 
wave in a phonon gas. If the phonon velocity near T = O is c (c is, of course, the 


* Some of Tisza’s ideas can be found in his following papers: Nature, 171, 913 (1938); 
J. Phys. et Radium, (8) 1, 164, 350 (1940); Phys. Rev. 72, 838 (1947). 

+ L. D. Landau, J. Phys. USSR 5, 71 (1941). See also: L. D. Landau and E. M. 
Lifshitz, Fluid Mechanics, Addison-Wesley, Reading, Mass., 1959, ch. XVI; and 
L. D. Landau and E. M. Lifshitz, Statistical Physics, Addison-Wesley, Reading, Mass., 
1959, ch. VI, §§66—67. 

t E. L. Andronikashvili, J. Phys. USSR 10, 201 (1946); Zh. Exsperim. i Teor. Fiz. 18, 
424 (1948). 
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Fig. 11.5 Andronikashvili’s experiment. 


l 2 3 


Fig. 11.6 Velocity of second sound. 


velocity of first sound), the ordinary theory of sound propagation predicts that 


the velocity of second sound c, = clV 3 as T—>0. (See also Section 9.2.) 
Tisza associated second sound with the superfluid component rather than the 
normal (phonons), and as a result he predicted that c > 0Oas T—->0. Fora 
while there was a controversy on this point, but experiment soon proved Landau’s 


supremely confident prediction that c, > clV 3 to be essentially correct. (Actually 


Cc, > cl 3 at very low temperatures, less than 0.5°K.) The actual curve of 
second sound vs. temperature is given in Fig. 11.6. 

To explain this curve between the j-point and 1°K, it must be assumed that 
there are other excitations besides phonons. Landau derived the excitation 
curve of Fig. 11.7 empirically; it has since been developed theoretically. At the 
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Fig. 11.7 Excitation curve for phonons and rotons. 


lowest temperatures, the mean free path of the phonons becomes large; so it 
becomes difficult to measure the speed of second sound. 

At low temperatures, E(P) = cP and the excitations are phonons. In the — 
region around Pp, 


_ 2 
E(P) = A + P= Po) 
ay 


where A is a constant and yp is some effective mass. Landau called these excita- 
tions “rotons.” 

The two-fluid model of Landau can be put on a more quantitative level by 
considering the statistical mechanics of a phonon-roton gas. 

The free energy is 


d>p 


F = kTIn(i — e~ Ber 
( ) (2nh)? 


(11.1) 


It is clear that the low-energy excitations (phonons) and the excitations around 
Po (rotons) contribute the most to the integral. Well below 1°K, however, the 
contribution due to the phonons dominates. Above 1°K, the rotons dominate. 
The average energy and specific heat due to the phonons (E = Pc) is 
found as follows: The expectation value of the number of phonons of energy E is 


Yoel 1 
yar e7 B(nE) eBE _ | ePclkT _ : 


\. Pe 4nP?dP  4n°k*T* 16n°k*T 3 
Eon = a kee ees SS — en ner 


ee ee = bE 
ePcikT _ 4 (2rh)? 15h? 15h3c3 Ce 
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The specific heat due to the rotons (E = A + (P — P,)?/2y) is 


Crot = aia T 24S amie Gar are e : (11.3) 


To understand the Andronikashvili experiment and the meaning of p,, we must 
consider the statistical mechanics of a moving phonon-roton gas. Perhaps the 
simplest way to proceed is to assume that the superfluid background component 
is moving with velocity V,. Then the energy of an excitation, phonon or roton, 
is given by 

E= E(P) + P- V,. (11.4) 


A similar equation was derived for superconductors. An easy way to see this 
relation is as follows. Suppose a gremlin floating in the fluid gives it a kick and 
produces an excitation of momentum P. Then the velocity of the rest of the 
fluid is decreased by P/M, where M is the mass of the background fluid. If 
the fluid winds up with velocity V, plus the excitation, it must originally have 
had velocity V, + P/M, along with the energy necessary to create the excitations. 
The total energy must therefore be 4M(V, + P/M)* + E(P) » 4MV?2 + 
P-V,+ E(P). The energy the fluid has if it moves at velocity V, without an 
excitation is 4MV2. Therefore the energy necessary to excite the fluid without 
changing its velocity is E(P) ++ P:V. Now, applying ordinary statistical 
mechanics, we find that the expectation value of the number of phonons of 
momentum P is 
N 1 


p Q(E(P)+P-Vs)/kT _ 4’ 


(11.5) 


The total momentum density is evidently given by pV, + <P) where 


(Py = | geentee a3P 


(E+ P-V5)/kT aa (2nh)> 


, E/kT 3 
= — Cee LO) me + higher order in V, 
(eF/KT = 1)? (2nh)° 


By definition 


_ [ee d3P (1.7) 


(eE/kT = 1)? (27h)° ° 


In other words, from the point of view just adopted, p, is a derived con- 
cept and is not the density of anything. It is clear from Eq. (11.6.) that 
(p — p,)V, = p,V, where p, is the density of superfluid. 

As in Section 9 of Chapter 10, we can consider the case of an unstable 
state by minimizing E — TS, subject to the condition that there is a fixed 
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expectation value of the momentum. In other words, now the excitations can 
be forced to drift. The result is 
1 


P eBLE(P)—P-Vn+P-Vs] ay | 


and <P) v = DAV a V,). 


11.2 SUPERFLUIDITY—AN EARLY APPROACH 


We saw in the previous section that the Landau two-fluid theory and the 
empirically derived excitation curve (Fig. 11.7) explain a great many of the 
superfluid properties of liquid helium. Thus, a deep understanding of super- 
superfluidity can be obtained by a theoretical deduction of the excitation curve 
and of course, the transition. In particular, we must explain: 


1. Why there are so few excitations at low energy—which is the central 
feature of superfluidity. 


2. Why the excitation curve has its particular form. 
3. Why.there is a transition, and 
4. The quantitative nature of the transition. 


The Transition: Proof that superfluidity is not explained by quantum-hydro- 
dynamics. Although the answer to why there is a transition has been known 
for some time,* this problem has a rather curious history. The Einstein-Bose 
condensation (Section 9 of Chapter 1) predicts that an ideal Bose gas at the 
density of liquid helium will undergo a sharp transition at 3.2°K, which is 
remarkably close to the A-point, 2.18°K. Liquid helium* obeys Bose statistics, 
but is of course not an ideal gas. Nevertheless, it had long been felt that the 
A-point transition is some sort of Bose-Einstein condensation, and as will be 
shown later, this is indeed the case. 

However, a seemingly alternative explanation of the transition was supplied 
by Landau in 1941. Landau, using a procedure termed the “‘quantization of 
hydrodynamics,” developed a set of commutation relations that seemed to suggest 
that there were an energy gap, phonons, rotons, and so forth. These were never 
deduced by Landau. If, however, it is assumed that the quantization of hydro- 
dynamics gives rise to the excitation curve, it is not unlikely that all of super- 
fluidity including the transition can be explained by “quantum hydrodynamics.”’ 
Furthermore, since quantum hydrodynamics has nothing to do with statistics 
(the way Landau developed it), the validity of Landau’s approach would imply 
that He as well as *He undergoes a transition, and the Bose condensation is 
not the cause of the transition. 


* Feynman, R. P., “Application of Quantum Mechanics to Liquid Helium” in 
Progress in Low Temperature Physics, Vol. 1, edited by C. J. Gorter (1955). Dingle, 
R. B., “Theories of Liquid Helium II,”’ Advances in Physics 1, No. 2, p. 111 (1952). 
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Thus, for a while there were two alternatives purporting to explain the 
liquid-helium transition. The refusal of *He to undergo a transition as lower 
and lower temperatures were attained perhaps threw doubt on the Landau 
hypothesis. However, I think I have proved that quantum hydrodynamics 
does not predict the excitation curve as had earlier been hoped by Landau and 
others. An outline of this proof is as follows: 

We consider a fluid and adapt the Lagrangian point of view. That is, we 
describe the state of the fluid by giving the displacement R(ro, t) of each particle. 
ro is the position of the particle at time ¢ = O and has the effect of labeling or 
distinguishing it. Each particle obeys quantum mechanics, and since the 
particles are distinguishable, quantum statistics need not be considered. It is 
then possible to find a set of commutation relations, and to show rather easily 
that there are a tremendous number of low-energy excitations (i.e., there is no 
energy gap, etc.). Now it is possible to convert the commutation relations from 
the Lagrangian to the Eulerian point of view. In the Eulerian point of view, the 
density, velocity, and so on, are functions of r and ¢t where r is a fixed point in 
space. The result is that the Eulerian commutation relations so obtained are 
precisely those obtained by Landau! 

Thus, according to Landau’s quantum hydrodynamics there are a multitude 
of low-energy excitations, and hence this theory does not explain superfluidity. 
Landav’s error is that by tacitly assuming the particles to be distinguishable, he 
neglected the effects of quantum statistics. 


Theory of Superfluidity: We now want to explain superfluidity from first 
principles, given a collection of N He* atoms at density p 9. The two-body 
interatomic potential is given in Fig. 11.8. It is clear from this figure that the 
helium atoms resemble hard spheres of diameter 2.7 A. Since at normal density 
there is 45 A? per atom the atoms are not tightly squeezed. 

The first question one may ask is, why is He* still a liquid at T = 0, at 
zero external pressure? If we calculate the potential energy of a helium lattice 


Potential (°K) 


Atom separation (A) 


Fig. 11.8 Two-body interatomic potential. 
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and add the zero-point kinetic energy, we obtain the total zero-point energy, 
which is approximately the correct value. However, the kinetic or oscillatory 
energy is so large that the oscillatory amplitude is of the order of a lattice 
spacing. Thus the lattice “‘melts”’ due to the lattice vibration. With the exception 
of He°, no other substance has such a high ratio of zero-point kinetic energy to 
zero-point energy. This is because helium is very light (and hence has high 
natural frequency) and has a saturated outer shell, causing very weak interatomic 
forces. 


11.3 INTUITIVE DERIVATION OF WAVE FUNCTIONS: GROUND STATE 


We now turn to the central feature of HelI, which is the scarcity of available 
low-energy excited states. Rather than look at the Hamiltonian we shall 
‘“‘wave our hands,” use analogies with simpler systems, draw pictures, and make 
plausible guesses based on physical intuition to obtain a qualitative picture of 
the solutions (wave functions). This qualitative approach will prove singularly 
successful. 

Consider the ground state o(R,, R,,..., Ry), where R; is the radius vector 
of the ith atom. Each set of 3N numbers (N radius vectors) will be called a 
“configuration” and can be represented by N points in 3-space, the N points 
representing the positions of the N helium atoms (Fig. 11.9). (For ease of 
illustration, the figure is, of course, two dimensional). For each configuration 
there is an amplitude or number 9, which is high for probable configurations 
and low for improbable ones. We may describe the difference between two 
configurations, A and B, by saying that the atoms “move”’ from configuration 
A to B. The word “move” does not imply any dynamics. 

Now, what can be said about o? First of all, because He* is a Bose- 
Einstein liquid, ¢ must be symmetric. That is, an interchange of two particles 
leaves ~ unchanged. Secondly, in analogy with simpler Bose-system stationary 
states, is real and has no nodes. That is, @ is always positive. 


©) 
m° G 


Fig. 11.9 A configuration of helium atoms in 3-space. 
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To see that ~ can be taken to be real, consider Schrédinger’s equation for @: 
— hy? 
» V; + >, ver) | p = Eg. 
2m Fi i<j 


Clearly, g* satisfies the same equation; so either ig is real or 9 + g* is a 
nonzero real solution to Schrédinger’s equation energy E. 

If there are several degenerate eigenfunctions, first pick one solution, use it 
to construct a real eigenfunction, g,, then consider the space of degenerate 
eigenfunctions orthogonal to @, and repeat the procedure. In this way we can 
obtain a set of real eigenfunctions with energy E. 

To show that @ does not pass through zero if its energy is as low as possible, 
we will show how to find a function of lower average energy Ix», when a wave 
function @ with nodes is given. Since |x» is the lowest possible in the ground 
state, it will show that @ is not the ground state, and therefore that the ground 
state has no nodes. Suppose we fix all the arguments of @ except ¢, and let 
O(X1, X,..., &,...) = ®(€) look like Fig. 11.10a. Then construct a new wave 
function that looks like Fig. 11.10b. This wave function has the same value of 


and since the energy may be expressed as 
(py = LeHod™ R _ §[1/2m 5; (Vip) + E V9? 1d"R 
J 9? d®R fo? d®R 


our new wave function has the same energy as the old. We can now lower the 
energy by smoothing out the wave function (see Fig. 11.11), that is, we can 


(11.8) 


(a) (b) 


Fig. 11.10. To show that a solution with lower energy can always be found, given 
a wave function with nodes. 


Iwas 


Fig. 11.11 A smoothed version of Fig. 11.10b. 
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drastically decrease (V@)” near the node without increasing the ~” very much. 
It follows, by the way, that the state of lowest energy is nondegenerate. If 
it were degenerate, we could, by appropriate choice of c, and c,, make a wave 
function 
W = CP, + C292 


that has nodes, and therefore does not have the lowest possible energy. Thus, 
we can take the state of lowest energy to be real and everywhere nonnegative. 
Figure 11.8 indicates that the helium atoms are almost hard spheres. Thus 
any configuration (e.g., Fig. 11.9) that includes two overlapping atoms gives a 
very small amplitude—call it zero. From Eq. (11.8) we see that the energy is 
high where there are large gradients in gy. Thus we want the ground-state wave 
function @ to vary as slowly as possible. Now consider in Fig. 11.12, the atom 
® to ‘“‘move” about, while the other atoms remain fixed. @ for configuration (b) 
must be very small; otherwise there would be a high gradient as ® moves the 
small distance from (b) to (c). In other words, configurations where two atoms are 
very close are unlikely, and in the ground state the most likely configurations are 
those where the atoms are approximately evenly distributed (Fig. 11.12a). 

Thus we have deduced that @ is symmetric, real, always positive, largest 
when the atoms are evenly spaced, and lowest when there is clumping. An 
expression that satisfies these requirements is: 


ep = exp Ese) | = I] F(r;;) (11.9) 


where r;; is the distance between the i and j atoms, and F is given approximately 
in Fig. 11.13. Fis occasionally approximated by some simple function that fits 
Fig. 11.13, such as F = 1 — a/r;;. Since @ is expressed in terms of the one func- 
tion F, it is natural to ask why a variational principle to determine F is not used. 
The answer is that the integral over all the 3N variables cannot be expressed 
simply in terms of F. 


O 
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Fig. 11.12 a, b, andc. A configuration in which one atom “moves” about while all 
the others remain fixed. 
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a 


Fig. 11.13 The function F of Eq. (12.9). 


Note that the nondegeneracy of the ground state implies that it is a sym- 
metric state for any many-particle system, when the particles are identical. For, 
since the Hamiltonian is invariant under a permutation P of the identical 
particles, then g(PR) will also be a ground state when g(R) is. But since there 
is no degeneracy, we must have 


P(PR) = «,0(R), 


and since (PR) is always positive and real (all @ values are) we have « = 1. 
Thus ¢ is totally symmetric. (Remember that this function might yet not be 
allowed by statistics.) 


Excited States: Explanation of Superfluidity: Again, in the analogy to simpler 
Bose systems, the first excited state wave function, w, must have one node and 
be symmetric. Since g is always positive and J py d‘R = 0, w must be positive 
for half the configurations and negative for the other half. We now come to 
the key argument of superfluidity: The only low-energy excitations are phonons. 
Phonons are excited states of compression, or states involving small displace- 
ments of each atom with a resultant change in density. For the phonons to 
have low energy, the density fluctuations must be over large distances; in other 
words, the phonons must have long wavelengths. The central problem is to see 
why no states other than these phonons can have such low energies. 

Now yw, which we assume is not a phonon state, must be orthogonal to each 
phonon state as well as to g. This means that WY must vary from plus to minus for 
configuration changes that do not alter the large-scale density. If & is to have 
low energy, this change from plus to minus must be very slow, that is, it must 
involve large-scale “movements” of the atoms. But the fact that the particles 
are indistinguishable and obey Bose statistics makes such a movement 
impossible! 
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Fig. 11.14 Two configurations (solid and dotted) that result from large displacements 
(long arrows) of the atoms, can actually be accomplished by much smaller adjustments 
(short arrows) because of the identity of the atoms. (From Reference 1). 


Now let us quote from the first reference cited in Section 11.2*: 


“The function yw takes on its maximum positive value for some configuration 
of the atoms. Let us call this configuration A, and the particular locations of the 
atoms «-positions. We said that the «-positions must be well spaced, so that 
the atoms do not overlap, and further that they are, on a large scale, at roughly 
uniform density. Similarly, let us call configuration B, with atomic positions B, 
that for which w has its largest negative value. Now we want B to be different as 
possible from A. We want it to require as much readjustment over a long dis- 
tance as possible to change A to B. Otherwise & changes too rapidly and easily 
from plus to minus, our wave function has a high gradient, and the energy of 
the state is not as low as possible. 

“Try to arrange things so that A requires a large displacement to be turned 
into B. At first you might suppose it is easy. For example (see Fig. 11.14), in A 
take some atoms in the left side of the box containing the liquid and move it 
across to the other side of the vessel, and call the resulting configuration B. 
One objection to this is that an atom is moved from one side to another, so a 
hole remains at the left and an extra atom is at the right. This represents a density 
variation. To avoid this we may imagine that another atom has been removed 
at the same time from right to left, and the various holes and tight squeezes 
have been ironed out by some minor adjustments of several of the neighboring 
atoms. This movement of two atoms each through a distance equal to the size 
of the vessel, one from left to right and the other from right to left, is certainly 
a long displacement, so B and A seem very different. But they are not. 

“The atoms must be considered as identical; the amplitude must not depend 
on which atom is which. One cannot allow w to change if one simply permutes 
atoms. The long displacements can be accomplished in two steps. In the first 
step permute the atoms you wish to move to those a-positions closest to the 


* Feynman, R. P., Ibid. 
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ultimate position they are to occupy in the final configuration B. This step does 
not change w because all the atoms are still in the same configuration of «- 
positions. Then the change to the B configuration is made by small readjust- 
ments, no atom moving more than half the atomic separation. In this minor 
motion yw must change quickly from plus to minus and the energy cannot be low. 
Because the wave function is unchanged by permutation of the atoms it is 
impossible to get a B configuration very far from the A configuration. No 
very-low-energy excitations can appear (other than phonons) at all. 

‘In the phonon case we consider configurations in which, as w changes sign, 
the density distribution changes. A change in density cannot be accomplished by 
permuting atoms. That is why the Bose statistics does not affect phonon states. 
But it leaves them isolated as the lowest states of the system, so that the specific 
heat approaches zero as T approaches zero according to Debye T° law. This 
is the key argument for the understanding of the properties of liquid helium.’’* 


11.4 PHONONS AND ROTONS 


We have been assuming that large-scale density variations are long-wavelength 
phonons. That this is so can be seen as follows: 

Let p(R) be the density of the helium liquid. Then, Fourier analysis of 
p(R) gives p(R) = Dx Qxe *'* and 


H = Gk + C’K7qh), (11.10) 
K 


which is the Hamiltonian for a collection of 3N independent harmonic oscillators 
(normal modes) each of frequency a = KC. Each mode is quantized and, by 
common usage, if the Ath mode is excited to energy Ex = hKC(n + 4), we 
say that there are n phonons of frequency KC. The ground-state wave function 
for a set of independent oscillators is 


Xes = [] exp a) = xp | -5 E28. (11.11) 


K 2 KC 


The low excited state of one phonon of momentum AK is given by 


pip a Eee eae, 
: 2 K’#K K’'C 2 KC 


However, x,, is not the true ground-state wave function in the case of liquid 
helium, since it does not contain enough short distance detail. For example, 
it doesn’t tell us that two atoms cannot overlap, and so forth. Let @ be the true 


* Feynman, R. P., Phys. Rev. 91, 1291, 1301 (1953); 94, 262 (1954). Feynman, R. P. 
and Cohen, M., Phys. Rev. 102, 1189 (1956). Feynman, R. P., Progress in Low Temper- 
ature Physics (edited by C. J. Gorter), ch. II, (1955). 
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ground state, which takes the short distance factors into account. Then in 
analogy to Eq. (11.12), the true low excited state Wx is given by 


Wk = IkQ’. (11.13) 


We are talking about long-range density fluctuations, and K is small. Since the 
operator p(R) = >; 6(R — Rj), we obtain 


qx = | ecw d°R= yen, (11.14) 
and from Eqs. (11.13) and (11.14) 
WPhonon = (x on) DP. (11.15) 
Small K i 


Actually, we should write in place of Eq. (11.10) 
p(R) = ¥ (ax sin K*R + qx cos K -R). 
K 
Then 


* 2 242 
H = i es + i) + C?K*(k + 4x). 
K \ 2 2 

The singly excited states are Wx = gxgy and Wx = qq. Then the state Wx + iW, 
also represents a one-phonon state, and is equal to (>; e’* *‘)g. Equation (11.15) 
helps us to understand why it is impossible to have as a low energy non-phonon 
excitation a single atom moving through the helium fluid. For the wave function 
of such an excitation is e’***'g, which is not symmetric. Blindly symmetrizing 
this function gives 5°; e’“’ "ig, which represents a phonon excitation. It is 
possible, however, to have a (symmetrized) wave function >; e'“'®o if 1/K is 
of the order of the atomic spacing, a. The energy associated with such an 
excitation is 

E = h*K?/2m ~ h?/2ma?’, 


which is very close to the excitation energy of a roton. 


Rotons: Higher Energy Excitations: We have seen that low-energy non-phonon 
excitations are impossible. In other words, there are no possible long-distance 
movements of the atoms that do not change the density. So now let us try to 
construct a wave function y that goes from plus to minus, without a density 
change, as slowly as possible, but which of necessity involves only short 
movements. 

Again let A be the configuration for which wy takes on its maximum positive 
value, and B the configuration for which w takes on its maximum negative values. 
The positions of the atoms in configuration A will be called «-positions and, in 
configuration B, B-positions. The change from A to B must involve only a small 
displacement of each atom and yet involve as large a total movement as possible. 
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Fig. 11.15 In this configuration the f-sites are between the a-sites. 


We know that the a-sites in configuration A must be evenly spaced, and the 
same for the f-sites in B. These conditions imply that the B-sites are between 
the a-sites as in Fig. 11.15. 

Suppose for a moment that each atom is either on an a- or a f-site, and let 
N, and N, be the number of atoms in each type of position. Then 
yw = (N, — N,)¢ is maximum positive for configuration A, maximum negative 
for configuration B, and passes smoothly from A to B. wW can be expressed 
mathematically by defining a function f(R;) to be +1 if R; is at an a-site and 
—1 if R; is at a f-site. 


= (Zs) 0. (11.16) 


For an intermediate position (that is, one for which R; is between an « and 
B-site), Ww will vary smoothly if f(R;) is taken to vary in some smooth way 
between its extreme values of +1 and —1. 

We now want to use the variational principle to find the best possible f(R), if 


v = Ds(Re = Fo: F = DSR). (11.17) 
The energy integral to be minimized is 
_ J Di 7 /2m) Vahl? + Vil?) d**R 
E) = 2, 11.18) 
= pie aR 


When w = Fo, this integral can be written in an interesting way, which is 
quite general (though we take ¢@ real and also assume ¢ vanishes at infinity). 
Replacing w by Fo in Eq. (11.18) gives us 
Z 
(E> = »» — | IF |7(Vi9)(Vi¢) d°"R + iG *V.9)Q(ViF) d°"R 
i 2m | 


4 | (FV,o)9(V,F*) d?"R + | orIneP aR | 


+ [rire anr}|| Lee d?™R. (11.19) 
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Integrating the first integral on the right of Eq. (11.19) by parts and making the 
substitution Ho = Eoq gives finally 


_ Di ) (h?/2m)|ViF | py d°“R 
J IF ?py aR ) 


e=<E) — Ey (11.20) 


where 
py(R,, eae, Ry) yp (Ri, oa eS Ry). (11.21) 


py is evidently the probability of a given configuration. ¢ is the energy of 
an excitation, and E, is the energy of the ground state. 

Returning now to the specifics of the liquid helium problem, we have F = 
>; /(R;) and we wish to minimize Eq. (11.20) to find the best value for f’ The 
denominator of Eq. (11.20) becomes 


[Fe d>*R EE [srry iG aue aR, | d°R, d°R; 


eo | PRD F(R) G(R, — R,) dR, d?Rz, (11.22) 


where 
pog(Ry — Ry) = SY | 5(R) — Ry) O(R} — Rp)py d2°R’. 
ij 


Here ¢ is normalized so that | py d*“R = N = number of atoms. pg is the 
liquid number density in the ground state and g(R, — R,) is the probability of 
finding an atom at R, per unit volume if one is known to be R,. The numerator 
of Eq. (11.20) can be written as 


i 


2 2 
Y | 2 Wr Pon d?°R = po * | wf(Ry? aR. oes 
am 2m 


Equation (11.20) can now be rewritten as 


h?/2m { |Vf(R)|? d3R 


° = TPRDS(RDg(R, — R,) @R, OR, aa 


By setting the variation de = 0 to minimize ¢ we obtain the integrodifferential 
equation 
2 
- = V7f(R) = e | ate — R’)f(R’) d?R’. (11.25) 
m 


The solution to Eq. (11.25) is f(R) = eR’, 


y= (x om) Q, (11.26) 
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and 
h? K ? 
E= 
2mS(K) 


where S(K) = [en “Ro(R) dR. (11.27) 


S(K) is the form factor for scattering of neutrons from the liquid and can be 
found experimentally (Fig. 11.16). From Eq. (11.27) it is the Fourier transform 
of g(R), which is the probability per unit volume of finding an atom at a distance 
R from a given atom in the iiquid in the ground state. 

Since states of different momenta are orthogonal, the energies of Eq. (11.27) 
are significant not only for K near the minimum but also in the neighborhood 
of this value. Although in the derivation of Eq. (11.27) it has been assumed that 
K ~ 2z/a (i.e., K is large) where a is the atomic spacing, the analysis is valid for 
all K as long as W = (> f(R,))@ is a reasonably good wave function. But we 
see from Eq. (11.15) that for small K, yy = (<; f (R;))@ represents phonons, and 
is a very good wave function. Hence the results of Eqs (11.26) and (11.27) are 
valid over the entire range of K. From Eq. (11.27) and Fig. 11.16 we can 
plot (K). 

The minimum occurs near K = I/a, and the excitations in this region are 
the rotons. 

It is interesting to note that the excitation curve (Fig. 11.17) can now be 
determined directly, by measuring the energy loss suffered by a scattered neutron 
of incident momentum AK. For a given angle of scattering, there will be a 
minimum energy loss, which is the energy e(K) of a single excitation. 


11.5 ROTONS 


A roton can be visualized as a group of children going down a slide. The slide 
is the AB part in Fig. 11.18. After sliding down, the children go around and 
come back to the slide again. 

The returning-flow part is not included in Eq. (11.26) of Section 11.4 for the 
roton wave function 


y= (x on) Psroune (11.28) 


U 


but is derived as follows. When we construct a wave packet out of the rotons of 
Eq. (11.28), we have a localized excitation. This localized wave packet does 
not move, because the group velocity of the roton state is zero as is seen in 
Fig. 11.17 (where de/dk = 0). This localized excitation is associated with the 
momentum AK. When we calculate the quantum mechanical current density j 
for a roton, we find it is not zero: 


j #0. (11.29) 
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S(K) 


K(R’) 


Fig. 11.16 Experimental result for the form factors for scattering neutrons from the 
liquid. 


(From Feynman's theory) 


Energy E (in °K equivalent) 


“\—_ Phonons “— Rotons 


1.0 2.0 3.0 


: = 
Momentum (wave number in A ) 


Fig. 11.17 Experimental and theoretical excitation curves. [Feynman, Progress in Low 
Temperature Physics (1955) ]. 
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Fig. 11.18 Schematic of the movement in a roton. 


——*« OO 


O 


332 Superfluidity 


Fig. 11.19 A wave packet of rotons. 


Fig. 11.20 Conservation of current. 


Then we have the picture shown in Fig. 11.19. We see j # 0 in a localized 
region and j = O outside. This contradicts the Schrédinger equation, since 
the current must be conserved. 

This difficulty is avoided by considering the slow return of current to the 
packet (Fig. 11.20). Mathematically this is achieved by improving Eq. (11.28) 
by writing 


y= (x exp ix ‘Ro + DSR; r)|) Puoma (11.30) 


and finding the best form of f(R; — R,). After the variation we find the best 
form is 


W = ek (: tay ie a ) eau (11.31) 


This form conserves the current. 
As is suggested by Figs. 11.18 and 11.20, the roton is like a classical smoke 
ring, as shown in Fig. 11.21. There is one difference, though. The smoke ring 
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Fig. 11.21 A roton has the same shape as a perfect smoke ring. 


Ww” 


Air flow ----3> 
a> 


Fig. 11.22 The attraction between two opposite vortices of a smoke ring is balanced by 
its motion. 


moves, whereas the roton does not, as has already been mentioned. Because 
two vortices of opposite sense attract each other (Fig. 11.22), there is another 
important difference: In the case of the smoke ring, this attraction is balanced 
by the flow of air through it, so that the ring does not shrink to a point. For 
the roton, since it does not move, the diameter of the ring shrinks down to the 
atomic scale. 

When the atom in position A in Fig. 11.18 goes to position B, the state 
returns to itself. This leads to a condition on K in e*’®, K is of the order of 
1/a where a is the average distance between atoms. This agrees with the curve 


in Fig. 11.20. Thus the roton is associated with a momentum and velocity of 
approximately 


p=hla; v= hima. (11.32) 


When there is more than one roton, they interact. The interaction becomes 
important for high temperatures. Landau and Khalatnikov* calculated the 
attenuation of the first and second sounds from kinetic theory taking into 


account the cross sections of phonon-phonon, phonon-roton and roton-roton 
scattering. 


* For detailed discussion and references, see I. M. Khalatnikov, An Introduction to 
the Theory of Superfluidity, W. H. Benjamin, 1965. 
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11.6 CRITICAL VELOCITY 


Let us find the energy necessary to create an excitation of momentum p. 
Suppose a body of velocity V and mass M creates the excitation and winds up 
moving with velocity V’. We see by conservation of momentum 


MV = MV' + p, 


so that the new kinetic energy of the body is 


2 
1MV’? = 14MV2 — V-p + ave (11.33) 


The energy of the excitation is written as e(p). The excitation cannot be created 
unless 


IMV? > 1MV” + ep), 


SO 


1p Po wp: 
e(p) < V-p ar a V -p (11.34) 
for large M. 

The critical velocity necessary to create an excitation is then derived by 
drawing a tangent to the e(p) vs. p curve as shown in Fig. 11.23. 

The critical velocity calculated from this condition is about 60 m/sec. 

We might say that this is the reason why helium is a superfluid; it is not 
excited for small perturbations. Although this explanation seems to be 
qualitatively satisfactory, excitations occur for velocities of about 1 cm/sec. 
Now we have to reverse our position and look for the reason why liquid He 
can be excited with such a small velocity. 


e( P) 


Critical condition, 
e(p) = Vp 


Fig. 11.23 The critical velocity is at tangent to the e(p) versus p curve. 
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11.7 IRROTATIONAL SUPERFLUID FLOW 


When liquid helium is undergoing a mass motion of velocity V the wave function 
is 


Ww = exp (: . 2; V “R,) Dsecinas (11.35) 


Note the difference between Eq. (11.35) and the roton or the phonon wave 
function of Eq. (11.15). The exponential part in Eq. (11.35) may be interpreted 
as follows. 


P+Rog. = Nm (5 RIN) = mV->, R,, (11.36) 


where P is the momentum of the whole system, Rog _ the center-of-mass 
coordinate, and N the number of particles in the system. 
As in the case of superconductivity, for a nonuniform velocity we assume 


y = exp [ y ac ae (11.37) 
and the velocity field V(R) at R is found from @(R) by using 
v(R) = ” vorr). (11.38) 
m 
We see from Eq. (11.38) that 
V x V(R) = 0, 


so that when the system is singly connected there is no rotation. When the 
system is doubly connected, we can have a circulating Vin the system as pictured 
in Fig. 11.24. We take | 

0 = ka. 


For this choice of 0, V(R) is perpendicular to the radius and has magnitude 
hk 
[VJ = —. 
mr 


Cc: 


Fig. 11.24 Circulating V in a doubly connected system. 
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k is restricted to the integers by the requirement that the wave function must be 
single valued. 


@ = no 
Vig, *(11.39) 
mr 
and 
bY-as = on fn 
m 


The unit of quantization is 


2n oe 2n x 1.5 x 107* cm?/sec. 
m 


What is the energy for this flow? If po is the fluid density in atoms per cc, the 
kinetic energy is (for n = 1) 


b h 2 
KE = | Pom (|) 2xr drL, (11.40) 


F mr 


where a and 5b are the radii as shown in Fig. 11.24, and L is the depth of the 
liquid. Thus: 


22 
Line energy per unit length = pox i In () = 10°? In () ergs/cm, 
m a a 


(11.41) 


where we used py = (3.6)"> A® = ae A®. If we neglect atomic structure and 
assume a Classical continuous liquid with surface tension, a unit line makes a 
hole. opposed by surface tension that we can calculate to be only 0.4 A in radius. 
That means that there is no real hole in the liquid. 

The energy associated with a vortex line can be estimated from Eq. (11.41). 
The lower limit a of the integral in Eq. (11.40) is considered as a length of the 
order of atomic spacing, because within about the atomic spacing the velocity 
formula is meaningless (and also the density is low near the center of the vortex). 
The exact determination of a would require us to solve the difficult quantum- 
mechanical problem. In almost all applications the ratio b/a will be very large, 
and the logarithm large enough to be insensitive to the exact value of a. For 
this reason we will not attempt a detailed evaluation, but will simply choose a 
to be close to the atomic spacing. We may arbitrarily take a = 4.0 A. 


* The quantization of a vortex line was first suggested by L. Onsager, Nuovo Cimento 
6, Suppl. 2, 246 (1949). 
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11.88 ROTATION OF THE SUPERFLUID* 


Let us consider the state of rotating liquid helium in a can (see Fig. 11.25). 
Operationally this state is defined by starting from solid helium in the can 
(under pressure > 25 atm at 0°K). First rotate the solid helium and then release 
the pressure to melt it; the liquid helium is then rotating with the angular 
momentum initially given to the solid. What is the final state of helium? We 
ask then for the lowest state of this rotating helium. 

For a system of given angular momentum, the kinetic energy is the least if 
the angular velocity w is a constant throughout the liquid. However, this motion 
is not rotation-free, because V x V = 2a, and a high energy (see Eq. 11.41) is 
required in order to set any small part of liquid helium into a rotational state. 
If we consider the helium to be a rigid body rotating, we get a reasonable energy 
for the rotation, but we cannot consider helium to be rigid. The energy to clamp 
the atoms into a rigid relationship with one another is very high, and must be 
taken into account if one wishes to calculate the energy for uniform rotation, 
with V x V, #4 O everywhere. A simple way to check the validity of the above 
reasoning is to consider two atoms of mass m in a harmonic potential. The 
energy of excitation of one of the atoms is 


1/2 
hoa =h (.) ; 
m 


But if the atoms could be rigidly held together, that energy would become 
h(k/2m)*'?, which apparently would mean that the presence of a second atom 
allows a lower energy of excitation. This paradox is resolved by noting that it 
takes energy to hold the atoms together. If only a limited energy is available, 
nearly all the parts of the fluid must be frozen out in their ground states. That 
is, nearly everywhere the local angular momentum is zero, so that, V x V = 0 
except at certain discontinuities. Suppose we assume that the angular momentum 
is carried by excitations. That is, the liquid is not free of excitations such as rotons 


Fig. 11.25 Rotating can of helium. 


* R. P. Feynman, chapter II in vol. I Progress in Low Temperature Physics, edited 
by C. J. Gorter (1955). 
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and phonons even when the temperature is 0O°K. The number and type of 
excitations is to be found by minimizing the energy for the given value of angular 
momentum. This energy, for vessels of centimeter dimensions turning at about 
one radian per second, turns out to be nearly 10* times the energy of a rigid 
body rotating at the same angular velocity. We must reject such a model. 

Thus, in order to achieve a state of lower energy for the given angular 
momentum, we examine velocity fields that are not everywhere continuous. 
As we saw in Section 11.7, we know that if there is a hole at the center of the 
liquid, circulation can exist. Thus a solution is suggested: the liquid forms a 
vortex around a hole with constant circulation as analyzed in Section 11.7. 
The velocity varies inversely as the radius, rising to such heights near the center 
as to be able to maintain the hole free of liquid by centrifugal force. The energy 
for this state is still quite a bit higher than the kinetic energy for the rigid body 
case, because the velocity instead of being distributed proportionally to the 
radius (the solid case), actually falls as the radius increases (see Eq. 11.39). 
Nevertheless it is orders of magnitude below the continuous V-field with excita- 
tions as suggested above. 

Once we admit the possibility that V is not necessarily continuous we can 
think of a state of lower energy. Suppose that the liquid has not only one vortex 
at the center, but several vortices. For example, suppose beside the central one 
there were a number distributed about the circle of radius R/2, half that of the 
vessel R, and all turning the same way as shown in Fig. 11.26. If the number of 
vortices at R/2 is large, this state is like a vortex sheet so the tangential velocity 
can jump as we pass from inside R/2 to outside as shown in Fig. 11.27. The 
gain in energy resulting from this improved distribution may more than com- 
pensate for the energy needed to make the additional holes (and, further, the 
central vortex need not now be so large and energetic). 

Continuing in this way with ever more vortices we soon notice that the 
energy can always be reduced if more vortices form. However, there is a limit. 
Due to the quantization of the vortex strength, the smallest vortex has cir- 
culation 2xh/m. The lowest energy results if a large number of minimum-strength 
vortex lines (which we will call ‘‘unit lines’) form throughout the fluid at nearly 


Fig. 11.26 The vessel as seen from above, with vortices all turning the same way. 
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Tangential 
velocity 


Radial distance 


R/2 


Fig. 11.27 Discontinuity in tangential velocity at R/2. 


uniform density. The lines are all parallel to the axis of rotation. Since the curl 
of the velocity is the circulation per unit area, and the curl is 2m, there will 
be (by Stokes’s theorem) 

m 


20 ——- = 2.1 x 10% lines/cm? (11.42) 
2th 


with w in radians/second. For w = 1 radians/second the lines are about 
0.2 mm apart, so that the velocity distribution is practically uniform. 

The energy associated with one vortex line is derived from Eq. (11.41). 
Taking a = 4.0 A and b = 0.2 mm, and using Eq. (11.42) we find 


Total line energy per unit volume = pywh In () = 14p mh. (11.43) 
a 


Thus the total line energy in the rotating liquid of radius R and a unit thickness 
is 149,whnR*. The kinetic energy for a rigid rotating body of radius R and 
unit thickness is 7p )R*w*/4. The ratio of the two is of order 


him 7 | 
tae. x 142 10°? (11.44) 


(the number 10? is calculated for R = 1 cm and w = | rad/sec). For macro- 
scopic laboratory dimensions the excess energy to form the lines is small. 
They would form if rotating solid helium is melted by releasing the pressure, the 
angular velocity distribution would differ imperceptibly from uniformity, and 
the surface should appear parabolic. 


11.9 A REASONING LEADING TO VORTEX LINES 


Suppose liquid helium is separated into an upper and a lower part by a friction- 
less surface, and the upper part is moving with velocity V (see Fig. 11.28). 
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(Upper part) ————_e» V exp“ i “= * RY ea: 


(Lower part) 1X Oop 


Fig. 11.28 A frictionless surface separating moving liquid helium from stationary 
liquid helium. 
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a Swe No surface tension here 


Fig. 11.29 Discontinuities at the dividing surface. 


ah Se oe 


Fig. 11.30 Vortex lines. 


In the upper part the wave function contains the factor 
exp (5 > Ri). 


When the dividing surface is removed there appears the surface free energy or 
the surface tension. Because at periodic points exp(im/h)vR = 1, thereisno surface 
tension at these points. The dividing surface is pictured in Fig. 11.29: around a 
discontinuity there is a circulation. Now you may ask, what is the best size of 
the slot for discontinuity? A calculation shows that the discontinuity is not the 
shape of a slot, but a line, so that the dividing surface would look like Fig. 11.30. 
Thus, this reasoning may lead to the assumption of the existence of vortex lines. 

Consider what would happen if liquid is flowing out of an orifice into a 
reservoir of fluid at rest. If the flow is irrotational, it looks like Fig. 11.31. A 
very high velocity develops near the corners and there are large accelerations 
there. An ordinary fluid, such as water, flows in a more complicated manner. 
For a rough estimate, let us suppose that the tube is a long narrow slot, and 
suppose the fluid tries to go out in a jet, at first of the same width and velocity 
as in the tube. Then outside this tube we have a situation similar to that shown 
in Fig. 11.28 and following the same reasoning as before, we expect to see the 
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| 
| 


Fig. 11.32 Vortex lines outside an orifice. 


vortex lines shown in Fig. 11.30 or in Fig. 11.32. Integrating over the dotted 


line we find 
V, = pv sar = n(n). 
m 


where v is the number of vortex lines in length |. The number of vortices per 
unit length along the line of flow is 


V V 
eee see (11.45) 
(2nh/m) 2h 
These vortex lines have an associated energy. From Eq. (11.41) the energy 
per unit length of the vortex line may be written as 


3 | 
~ 1p il In 6 , (11.46) 


m a 
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where p is the density of liquid helium, d is the width of the jet in Fig. 11.32, 
and a is the spacing between atoms, roughly 4 A as was stated at the end of 
Section 11.7. The vortices must move out at velocity V/2, the velocity at the 
location of the vortices. Thus the energy of vortices created per unit time per 
unit length of slot is 


2 2 
Uy: Beate ee in 2); ea 
2 2th m a 
In order to form the vortex lines, the kinetic energy of the jet per unit slot 
length must be greater than Eq. (11.47). The kinetic energy of flow coming 
out of the orifice per unit time per unit slot length is 


2 
Bay, ,- Vd. (11.48) 
The critical velocity is obtained by equating Eq. (11.47) and Eq. (11.48): 
i= a In (‘) ; (11.49) 
dm a 


For velocity greater than vo, there is enough energy in the flow to create vortices. 
To get a rough idea, if we use d = 10~° cm, so that In (d/a) = 6, we see 


Vo = 100 cm/sec. (11.50) 


The experimental value is about 20 cm/sec. The difference suggests that the 
model we have used is oversimplified. 

There is a connection between the concepts of vortex lines and rotons. 
Starting from the vortex lines as shown in Fig. 11.32 we see that they will 
gradually change into rotons as shown in Fig. 11.33. A vortex ring can be 
broken into two smaller rings. A roton may be regarded as a small vortex ring, as 
discussed in Section 11.5. 

There are experimental verifications of vortex lines. See, for instance, 
W. F. Vinen and H. E. Hall, Proc. Roy. Soc. 238, 204 (1956); 238, 215 (1956): 
H. E. Hall, Proc. Roy. Soc. 245, 546 (1958): W. F. Vinen, Nature 181, 1524 (1958). 
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Fig. 11.33 Vortex lines changing into rotons. 
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11.10 THE 4 TRANSITION IN LIQUID HELIUM 


When we know the curve for E(p), the energy versus momentum function, then 
the partition function and the specific heat can be calculated if we assume each 
excitation is independent. This calculation leads to the curve of the specific 
heat, which agrees with the experiment for the part drawn in solid in Fig. 11.34. 

The above approach, however, does not explain the existence of the transi- 
tion shown by the dotted curves in Fig. 11.34, for the interaction between 
excitations is not taken into account. 

For the region near the transition point the E(p) curve in Fig. 11.34 is not 
of much help. We start afresh and use a different method. 

In terms of path integrals we have 


orm t | {| a qin »» set + D Vln - lt iu) 


xi(O)= xj 
xi(U) = px; 


Dx, Dxs**? Dds dx, =e dy (11.51) 


As we concluded in Section 4.1 for high enough temperatures only the identity 
permutation counts, and we get approximately the classical partition function 
of helium. In this case, “high enough temperatures” means (1/f) [% (mx?/2) 
is large for x(0) = position of one atom, x(U) = position of another atom. If 
d is the average interatomic distance, x is on the average d/u and (1/h) |% (mx?/2) 
is of order mkTd*/2h?. For T > 2°K, 


and permutations cannot be important in Eq. (11.51). For low T (high U) 
permutations are important. Furthermore, the approximation that 
V[xu) — x,(u)] is independent of u for important paths is true only if U is 
small, so that x(u) ~ x(Q) to have an appreciable contribution to the path 


C 


Fig. 11.34 Specific heat of liquid helium; theory and experiment agree for the part 
shown as a solid line. 
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integral. For low temperatures, x(u) can vary greatly without giving rise to a 
large value of X*(u), and thus V(x; — x,) can vary over the path. 

To calculate the J transition, we must concern ourselves with what happens 
at low temperatures. We can visualize each path of the integral as a motion of 
every particle of the liquid from one position to another. “‘u”’ is thought of as 
time, and x,(u) is the position of the ith particle of “‘time” u. If, during this 
motion, x,(u) becomes too large, or if V becomes too large, the path gives only a 
small contribution. V[x,(u) — x,(u)] is large if at “time” wu particles i and j are 
close to each other. Of course we are not claiming that x,(u) is really the position 
of some atom at an actual time. But thinking in the manner described above 
allows one to apply physical intuition effectively. 

Consider a permutation in which a particle at position A moves to the 
position B, while the particle at B moves to a third position. (See Fig. 11.35.) 
We will consider the contribution to (1/4) © { (mx?/2) caused by A. 

As A moves to position B, it must somehow avoid passing through particles 
such as C. To get to position B in time U, particle A must move quickly enough 
to cover the distance. Since x is large for particle A, it is increased a large 
amount by increments of velocity so A does not have time to avoid C by trying 
to move around it. Instead, C is jostled out of the way. Suppose, now, that the 
distance from A to Bis r, and suppose that there are n particles that have to be 
pushed out of the way. nis of the order of r/d ,where dis the interatomic spacing. 
Each particle along the path has to move a distance of order d’ to avoid being 
too close to particle A. Particle A is close to particle C for a time of order u/n, 
so that particle C must move with velocity d’/(u/n) during that time, giving 
contribution 


mf{d'\?u — md'?n 1 fey imx? 
— -= to —-} ¥—. 
2h\u/n} n 2hu hjo i 2 


Fig. 11:35 Particle at A moves to B while particle at B moves elsewhere. 
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Thus the n particles give contribution 


md'?n?__ om [(d'\ r? 
2hu 2h? ( d Bo 
Particle A itself contributes mr?/2h7B. Thus the effect of the potential is to 
change e7 "7/2#? to e~ (— ymr?/28B) Where y = 1 + (d’/d)?’. 

We are going to proceed as if y were independent of r and f, and our 
reasoning above shows that this assumption is not too unreasonable. But 
remember that for displacements of order d we may not need as much adjust- 
ment, so y may be less if r is small. Also if the velocity is especially high, it may 
be preferable to pass through relatively high potentials, rather than have rapid 
adjustments. Thus again y would be lowered. Let us ignore such details. 

It is convenient to define the effective mass m’ as 


my = m' 


so that we may write e~”/2"*4r?_ Thus in the liquid state, so far as the path 
integral is concerned, a particle behaves like an ideal-gas particle* of mass m’. 
We have arrived at the important conclusion that the interaction among 
particles is “kinetic” rather than ‘‘potential.”” Kinetic interaction varies as 
r?/B, while potential varies as rf (as can be seen by estimating the potential 
contribution for paths that do not avoid high potentials). 
Therefore the partition function for low temperatures has the form 


pit LS m’ 3N/2 
N! P | \2nh?B 


x €xp eo 2 (R; > PR) P(R,, R,, aie '9 R,) d°R, d°R, aoe 
(11.52) 


The last factor comes from the potential-energy contribution of the initial 
configuration. The reason for the factor (m'/22h?B)°>*’? is not obvious, for it is 
not obvious how a change in the effective mass in the path integral will change 
the normalization of the integral. To be complete, we should instead write a 
factor K,(m'/2nh’B)°"’*. For simplicity, we will pay no attention to K,. The 
factor p is clearly small for initial configurations (equal to the final configurations) 
such that atoms overlap. We can, for example, expect results that are 
qualitatively correct by choosing p = 0 if any two R’s are closer than b (with 
“b” roughly 2.6 A) and p = 1 otherwise. 

A permutation among particles can be visualized as a polygon with arrows. 
The example shown in Fig. 11.36 consists of several cycles of length 1 (particles 


* The above results apply only to Bose particles. Fermi particles behave differently. 
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Fig. 11.36 Permutation among particles; here there is one cycle of length 2 and two 
of length 3. 


that are not permutated), one cycle of length 2, and two cycles of length 3. 
Thus, the sum >', in Eq. (11.45) is a sum over all possible polygon patterns. 

A way to.estimate Eq. (11.45) for temperatures near the transition tem- 
perature will be explained here.* For such temperatures, a side (of a polygon) 
longer than the average interatomic distance d is not important. Thus we may 
assume that the sides of all polygons that contribute have length of order d. 

Given a particular configuration (R,, R,,..., Ry) we restrict our permuta- 
tions to be such that R; and PR; are close. In order to get a contribution to the 
integral in Eq. (11.45), p forces the atoms to be roughly uniformly distributed. 
Consider the sum 


m' 2 
X exp PR py (R; — PR;) 
Large shifts in the R; that preserve the uniform distribution effectively permute 
the atoms and thus leave the above sum unchanged, except for the change that 
would result from small shifts in the R;. The effect of smaller shifts in the R; 
that do not permute the atoms can be included by replacing R; — PR; with an 
average interatomic distance, d. For all uniform distributions of atoms, we can 
therefore replace 


2h7B i 


» exp be 2 » (RB: - PR) | 
by 


yy’ exp (- a KD) P) 


P 2h*B 


* For a more complete discussion, see R. P. Feynman, Phys. Rev. 91, 1291 (1953). 
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where >", is the sum over all permutations of a fixed lattice such that the resul- 
ting polygons are made up of lines joining near neighbors, and n(p) is the total 
number of sides in the set of polygons defined by P. We can now take the above 
sum out of the integral in Eq. (11.45) and write 


/ 


m Nie — mn’ d2 Dh 
e-*F = constant x = Oe ee as 
2nh* B p 


In this equation, the difficult problem is the calculation of the sum. So ignore 
the first two factors on the right and set y = e-?/2"*8 We must compute 


ew PF w= SY ynte), 
P 


So we have the following problem: Given a lattice of points, lines can form 
between adjacent points to form polygons. With each line is associated an 
energy (m'd?/2h7B). We wish to calculate the free energy of this system. Let n, 
be the number of polygons with s sides and let C(n,, n,,...) be the number of 
permutations with n, polygons of s sides. Then 


eenee ye COmsmas-- If sn. 
s=2 


N1,N2,..- 


This sum is restricted by the equation 
[e.@) 
y sn, = N = number of atoms. 
s=1 


The restriction on the sum can be eliminated by the usual method of letting 

N vary while assigning to each atom a weighting factor, t (in Chapter 1, ‘‘t”’ 
Bu 

was e?"),. 


QO — ge #9 


> (mma. d9(E m)e(¥ sn) ait 


N15M2, 0... =2 s=2 


: NeW BFw. (11.53) 


From Eq. (11.46) it 1s easy to see that the expectation value of N is 


Ce ee ee _ 1p 49 


, 1. 
dt dt re 


To evaluate Eq. (11.46), we must find a way of approximating C(n,, n,,...). 
We will approximate C by writing 


R 
C(ny,n2,...) =|] ae 
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where R is the number of ways of drawing a polygon with s vertices on a lattice 


of <N > points (with all sides joining nearest neighbors). This approximation is 
based on the following assumptions: 


a. The number of ways of drawing a polygon with s sides is roughly independent 
of the polygons that have already been drawn. The effect of the competition 
between polygons for available atoms is taken into account by the factor 


6¢ 99 


T, 

b. If we allow overlapping of polygons there will not be much error. 
n, small enough so that given n, random polygons with s sides, there prob- 
ably will not be any that are on top of each other. 


Then we can choose n, polygons of length s in about R%/n.! ways. By 
assumption (a), the total number of ways of choosing sets of polygons with n, 
polygons with s sides is 


R&s 
A eer 
s N,: 
Equation (11.46) becomes 


OQ — >> qi Ri! Il ae (yt)s"s a2 y= (R,t)"! Il 2 [R.(yt)*|n, Jn, 


Ny,M2, 26. n,!s=2 a] ny n,! s=2 n,=0 nN, ! 
= exp [R,t] exp X RY v0 
or 
g = —kT [Rs + y Rios |. (11.55a) 

s=0 

t is chosen so that 
CN) = Rut + YY sR,(yt)’. (11.56a) 

s=2 


We now must find R,. If s = 1, we wish to find R,, the number of ways of 
picking one point out of (NV), so Ry = <N). Hereafter, write (N)> = N. In 
general, to find R, we let h, equal the number of polygons with s sides that can 
be drawn starting at a given atom. Then NA, would be the number of ways of 
drawing a polygon with s vertices if we started at any atom; but we have counted 
each polygon s times. Thus R, = N4&,/s. To avoid worrying about what happens 
when we get near the boundary, we simply consider the system to have periodic 
boundary conditions. That is, in two dimensions the system would have the 
topology of a torus. We will now approximate h, for fairly large s assuming 
that the temperatures are such that small s does not contribute much to the 
sums in Eq. (11.55a) and Eq. (11.56a). Suppose / is the number of nearest 
neighbors of each lattice point. Consider the problem of random walk on the 


11.10 The 2 transition in liquid helium 349 


lattice. There are /° equally probable random walks that start on a given atom 
and make s steps. In h,, we wish to consider only those ways in which the walks 
terminate at the origin. The fraction of walks that end up at the origin is in- 
versely proportional to the volume of space in which the walk is likely to end. 
Since the distance from the origin of the typical random walk is of the order 


dy/ s (d = length of each step), the volume in which a typical walk ends is 
proportional to s°/?._ Thus h, is proportional to /‘/s*/? for large s, where the 
proportionality constant is independent of s. Equations (11.55a) and (11.56a) 
become 


g = —NKTc +c Y) we (11.55b) 
eee ap a a (11.56) 


where c is a proportionality constant. For extremely large s, our result above 
for h, is incorrect. If one chain covers a volume comparable to the size of the 
whole system, then h, = /*/N, where 1/N is the probability that a random walk 
will end at the original point (or any other particular point). Then a more 


realistic h, might be 
h, ® (< + _ 1°, 
§3/2 N 


which has the correct behavior for medium and large s. In any case, the sums in 
Eqs. (11.55a) and (11.56a) converge for tyl < 1 and diverge for tyl > 1. At 
reasonably high temperatures y is small and the sum converges. Also, at such 
temperatures there are not too many polygons, so assumptions (a), (b), and (c) 
are not too unrealistic. Thus we expect our results to be qualitatively correct 
above the J point. Equations (11.55b) and (11.56b) resemble the results for a 
Bose gas (see Section 1.8) and lead to an increase in specific heat with decreasing 
temperature. The shape of the specific heat curve is shown in Fig. 11.37. 


Specific 
heat 


Fig. 11.37 Specific heat near the lambda point. 
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This analysis (which is supposed to be good for T > T,) thus explains why 
C increases as T approaches T, from above. This treatment does not lead to the 
discontinuity of the kind shown in Fig. 11.34, probably because of the approx- 
imations made between Eqs. (11.52) and (11.55a). In other words, we expect 
that the volume of the helium atoms described by p in Eq. (11.52) is the cause 
of the discontinuity in the specific heat. Kikuchi and others have written papers 
in which it is indicated that more accurate computations starting from Eq. 
(11.52) would lead to a discontinuity.* | 


Problem: How do these polygon calculations differ from the ones we did 
in Chapter 5 for the Onsager problem? 


* Kikuchi, R., Phys. Rev. 96, 563 (1954). Kikuchi, R., Denman, H. H., and Schrei- 
ber, C. L., Phys. Rev. 119, 1823 (1960). 
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